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(57) Abstract: The present invention relates to isolated nucleic acid molecules which restore fertility to cytoplasmic male sterile 
plants and modify expression of toxic mitochondria proteins by the plant. The present invention also relates to methods of identifying 
a candidate plant suitable for breeding with a cytoplasmic male sterile plant and methods of identifying a candidate gene restoring 
fertility in plants by analyzing for the candidate plant and candidate gene, respectively, for the presence of the nucleic acid molecule 
of the present invention. Also disclosed are methods of producing hybrid plant seed, methods of directing gene expression to plant 
mitochondria, and method of expressing a gene preferentially in roots of a plant. Promoters and terminators from plant genes which 
restore fertility to cytoplasmic male sterile plants and modify expression of toxic mitochondria proteins are also disclosed. Finally, 
methods of producing plants with a cytoplasmic male sterile plant restoration system are disclosed. 
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GENES FOR ALTERING MITOCHONDRIAL FUNCTION 
AND FOR HYBRID SEED PRODUCTION 

[0001] This application claims the benefit of U.S. Patent Application Serial 

5 No. 60/347,996, filed January 10, 2002, which is hereby incorporated by reference in 
its entirety. 

[0002] This invention arose out of research sponsored by the USDA NRI 

(Grant No. 98-35300-6171). The U.S. Government may have certain rights in this 
invention. 

10 

FIELD OF THE INVENTION 

[0003] The present invention relates to improving productivity or usefulness 

of plants by altering mitochondrial gene expression and to the production of hybrid 
seed. Specifically, the present invention relates to the use of genes that affect 
15 mitochondrial gene expression, some of which ameliorate male sterility and others 
which cause male sterility or altered floral development. The invention also provides 
a method of facilitating the identification of genes with similar functions in other 
plant species. 

20 BACKGROUND OF THE INVENTION 

[0004] A widely used method for producing hybrid seeds involves crossing a 

cytoplasmic male sterile (CMS) plant line with a fertile plant line. Typically, the 
fertile line contains a fertility restorer gene in its nuclear genome, so that all of the 
progeny are male fertile. All seeds collected from a CMS plant must result from 

25 cross-pollination. However, the hybrid seed so generated will itself be male sterile 
unless the male parent has brought a nuclear fertility-restorer gene into the next 
generation. The fertility of the progeny is important for productivity in plant varieties 
where self-pollination is responsible for production of the desirable crop. For 
example, a fruit crop of a self-pollinated species requires male fertility, while an 

30 ornamental species will produce attractive flowers or plant morphology even when no 
pollen is produced. 
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[0005] While a number of naturally occurring CMS/restorer systems exist and 

are currently in use for hybrid seed production, there are a number of crop species 
which lack known CMS and fertility restorer genes. For example, a hybrid seed of 
tomato is typically made by hand emasculation of plants to be used as female parents. 
5 This hand-made method of cross-pollination is quite labor intensive and cost- 
prohibitive for many crops. In addition, certain naturally occurring CMS/restorer 
systems have some drawbacks. For example, corn plants carrying the CMS-T 
cytoplasm are more susceptible to a blight disease. 

[0006] Fertility restorer genes that have been particularly useful for hybrid 

10 seed production are active as single dominant alleles at a locus, though multigenic 

systems are sometimes used. A Petunia fertility restorer locus termed Rf is known to 
be effective with no additional helper genes to restore fertility (Edwardson et al., 
"Fertility Restoration in Cytoplasmic Male Sterile Petunia" J. Hered., 58:195-196 
(1967); Izhar, "Cytoplasmic Male Sterility in Petunia. III. Genetic Control on 

15 Microsporogenesis and Male Fertility Restoration," J. Hered., 69:22-26 (1 978)). 

[0007] Nuclear fertility restoration genes confer normal pollen development 

upon plants carrying sterility-encoding mitochondria. The mitochondrial genes 
responsible for causing the male sterility have been identified in a number of species, 
including Petunia, maize, Brassica, and common bean. The expression of these 

20 CMS-encoding mitochondrial genes is affected by the nuclear restorer genes, as 

shown for Rf in Petunia (Pruitt et al., "Cytochrome Oxidase Subunit II Sequences in 
Petunia Mitochondria: Two Intron-Containing Genes and an Intron-Less Pseudogene 
Associated With Cytoplasmic Male Sterility," Curr. Genet, 16:281-91 (1989); 
Nivison et al., "Identification of a Mitochondrial Protein Associated With 

25 Cytoplasmic Male Sterility in Petunia," Plant Cell, 1:1121-30 (1989); Nivision et al., 
"Sequencing, Processing, and Localization of the Petunia CMS -Associated 
Mitochondrial Protein," Plant J., 5:613-623 (1994); Hanson et al., "Mitochondrial 
Gene Organization and Expression in Petunia Male Fertile and Sterile Plants," i_ 
Hered., 90:362-368 (1999)); Rfl in CMS-T maize (Dewey et al., "Novel 

30 Recombinations in the Maize Mitochondrial Genome Produce a Unique 

Transcriptional Unit in the Texas Male-Sterile Cytoplasm " CelU 44:439-49 (1986); 
Wise et al., "Mitochondrial Transcript Processing and Restoration of Male Fertility in 
T-Cytoplasm Maize/' J Hered, 90:380-385 (1999); Kennell et al., "Influence of 
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Nuclear Background on Transcription of a Maize Mitochondrial Region Associated 
With Texas Male Sterile Cytoplasm," Mol. Gen. Genet.. 210:399-406 (1987); Kennell 
et al., "Initiation and Processing of atp6, T-urfJ3, and ORF221 Transcripts From 
Mitochondria of T Cytoplasm Maize," Mol. Gen. Genet.. 216:16-24 (1989)); Rfpl and 
5 rfpl in Brassica (Singh et al., Suppression of Cytoplasmic Male Sterility by Nuclear 
Genes Alters Expression of a Novel Mitochondrial Gene Region," Plant Cell, 3:1349- 
1362 (1991); Singh et al., "Nuclear Genes Associated With a Single Brassica CMS 
Restorer Locus Influence Transcripts of Three Different Mitochondrial Gene 
Regions," Genetics. 143:505-516 (1996)); restorers in radish (Krishnasamy et al., 
10 "Organ-Specific Reduction in the Abundance of a Mitochondrial Protein 

Accompanies Fertility Restoration in Cytoplasmic Male-Sterile Radish," Plant Molec. 
Biol., 26:935-946 (1994)); restorers in sunflower (Horn et al., "A Mitochondrial 16 
kDa Protein is Associated With Cytoplasmic Male Sterility in Sunflower," Plant 
Molec. Biol.. 17:29-36 (1991); Laver et al., "Mitochondrial Genome Organization and 
15 Expression Associated With Cytoplasmic Male Sterility in Sunflower (Helianthus 
annuus)," Plant J.. 1:185-193 (1991); Moneger et al., "Nuclear Restoration of 
Cytoplasmic Male Sterility in Sunflower is Associated With the Tissue-Specific 
Regulation of a Novel Mitochondrial Gene," EMBO J.. 13:8-17 (1994); Smart et al., 
"Cell-Specific Regulation of Gene Expression in Mitochondria During Anther 
20 Development in Sunflower," Plant Cell. 6:81 1-825 (1994)); restorers in rice (Akagi et 
al., "A Unique Sequence Located Downstream From the Rice Mitochondrial atp6 
May Cause Male Sterility," Curr. Genet.. 25:52-58 (1994); Kadowaki et al., "A 
Chimeric Gene Containing the 5' Portion of atp6 is Associated With Cytoplasmic 
Male Sterility of Rice," Mol. Gen. Genet.. 224:10-16 (1990)); and Fr2 in broad bean 
25 (Chase, "Expression of CMS-Unique and Flanking Mitochondrial DNA Sequencs in 
Phaseolus vulgaris," L. Curr. Genet., 25:245-251 (1993); He et al., "Pollen Fertility 
Restoration by Nuclear Gene Fr in CMS Bean: Nuclear-Directed Alteration of a 
Mitochondrial Population," Genetics. 139:995-962 (1995)). The expression of 
various nuclear restorer genes has been reported to be either enhanced in reproductive 
30 tissue, as in the case of sunflower, or, as in the case of Petunia, expressed in both 
vegetative and reproductive tissues. Thus, different fertility restorer genes carry 
different promoters and nuclear expression regulatory elements which may confer 
very limited tissue-specific expression or very broad expression in the plant. 
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[0008] Reduction in the amount of the protein product of the CMS-encoding 

gene is the usual effect of these restorers whose target mitochondrial genes are 
known. These genes may possibly act by affecting the transcription or translation 
rate, the transcript or protein stability, processing, splicing, etc. Alleles of some 
5 restorer genes may up-regulate while others may down-regulate the expression of 
particular mitochondrial genes. Fertility restorer genes and their alleles or 
homologous counterparts in other species may thus be extremely valuable in 
engineering the expression of genes introduced into higher plant mitochondria. 
[0009] The cloning and sequencing of the restorer gene Rf2 in maize has been 

10 reported in Cui et ah, "The rf2 Nuclear Restorer Gene of Male-Sterile T-Cytoplasm 
Maize," Science, 272:1334-1336 (1996) and U.S. Patent No. 5,981,833 to Wise et al. 
This restorer gene acts in conjunction with a second required gene, Rfl, the gene that 
reduces the amount of the toxic protein, to restore fertility to plants carrying the maize 
CMS-T cytoplasm (Dewey et al., "Novel Recombinations in the Maize Mitochondrial 

15 Genome Produce a Unique Transcriptional Unit in the Texas Male-Sterile 

Cytoplasm," CelL 44:439-49 (1986); Dewey et al., "A Mitochondrial Pprotein 
Associated With Cytoplasmic Male Sterility in the T Cytoplasm of Maize," Proc. 
Natl. Acad. Sci. USA, 84:5374-78 (1987); Wise et al., "Urfl3-T of T Cytoplasm 
Maize Mitochondria Encodes a 13kD Polypeptide," Plant Mol. Biol., 9:121-26 

20 (1987)). Plants of genotype Rflrf2^ though sterile, have greatly reduced amounts of 
the URF13 protein. In contrast, sterile plants of genotype rflRf2 have abundant 
amounts of the URF13 protein. The Rf2 gene is, thus, unusual in that no effect on the 
expression of the maize T-CMS-associated protein, URF1 3, has been detected. The 
sequence of the gene bore out the absence of observable effect on mitochondrial gene 

25 expression; according to sequence analysis, Rf2 is apparently an aldehyde 

dehydrogenase (Liu et al., "Mitochondrial Aldehyde Dehydrogenase Activity is 
Required for Male Fertility in Maize," The Plant Cell 13:1063-1078 (2001)). It has 
been proposed that Rf2 acts by compensating for a metabolic defect caused by the low 
levels of the URF13 protein that remain despite the presence of Rfl, the gene that 

30 reduces the amount of the toxic protein (Dewey et al., "A Mitochondrial Protein 
Associated With Cytoplasmic Male Sterility in the T Cytoplasm of Maize," Proc. 
Natl. Acad. Sci. USA, 84:5374-78 (1987)) and also alters the l-urfl3 transcript 
profile (Kennell et al., "Influence of Nuclear Background on Transcription of a Maize 
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Mitochondrial Region Associated With Texas Male Sterile Cytoplasm," Mol. Gen. 
GeneU 210:399-406 (1987)). 

[0010] An abnormal recombinant mitochondrial gene in Petunia CMS lines 

(termed pcf) has been genetically correlated with CMS (Y oung et al., "A Fused 
Mitochondrial Gene Associated With Cytoplasmic Male Sterility is Developmentally 
Regulated," Cell, 50:41-49 (1987)). Because plant mitochondrial RNA is edited from 
C to U in some locations, the edited RNA sequence for the pcf gene has been 
determined, allowing the prediction of the pcf-encoded protein (Wintz et al., "A 
Termination Codon is Created by RNA Editing in the Petunia Mitochondrial atp9 
Gene Transcript," Curr. Genet., 19:61-64 (1990); Sutton et al., "Editing of Pre- 
mRNAs Can Occur Before cis- and fraws-Splicing in Petunia Mitochondria," Mol. 
Cell Biol., 1 1:4274-4277 (1991); Nivision et al. s "Sequencing, Processing, and 
Localization of the Petunia CMS-Associated Mitochondrial Protein," Plant J., 5:613- 
623 (1994); Hanson et al., "Mitochondrial Gene Organization and Expression in 
Petunia Male Fertile and Sterile Plants," J. Hered., 90:362-368 (1999)). Antibodies to 
synthetic peptide sequences have revealed the presence of a 19.5 kD PCF protein 
located in both the membrane and soluble fraction of mitochondria (Nivison et al., 
"Identification of a Mitochondrial Protein Associated With Cytoplasmic Male 
Sterility in Petunia." Plant Cell, 1:1121-30(1989)). The PCF protein is processed 
from a longer precursor protein and is entirely encoded by the urfS region of the pcf 
gene (Nivision et al., "Sequencing, Processing, and Localization of the Petunia CMS- 
Associated Mitochondrial Protein," Plant J., 5:613-623 (1994)). The PCF protein is 
strongly expressed in sporogenous cells of premeiotic petunia anthers in CMS lines, 
but undetectable in CMS-tf/lines (Conley et al., "Tissue-Specific Protein Expression 
in Plant Mitochondria," Plant Cell 6:85-91 (1994)). Abnormalities in Petunia pollen 
development are first observed in meiosis, and by the developmental stage where 
fertile plants are releasing pollen, CMS anthers are hollow shells (Conley et al., 
"Effects of Petunia Cytoplasmic Male Sterile (CMS) Cytoplasm on the Development 
of Sterile and Fertility-Restored Pparodii Anthers," Am. J. Bot.. 81:630-640 (1994)). 
It is evident that the pcf gene product is disrupting mitochondrial function, leading to 
death of the sporogenous cells, though the exact mechanism at the molecular level is 
not known. 
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[0011] In maize T, Petunia, rice, and Brassica Pol CMS systems, particular 

transcripts of CMS-associated genes have been reported to be altered in restored lines 
(Pruitt et ah, "Transcription of the Petunia Mitochondrial CMS-Associated pcflx>cus 
in Male Sterile and Fertility-Restored Lines," MoL Gen. Genet., 227:348-355 (1991); 
5 Dewey et al., "Novel Recombinations in the Maize Mitochondrial Genome Produce a 
Unique Transcriptional Unit in the Texas Male-Sterile Cytoplasm," Cell, 44:439-49 
(1986); Kennell et al., "Initiation and Processing of atp6, T-urfl3, and ORF221 
Transcripts From Mitochondria of T Cytoplasm Maize," Mol. Gen. Genet., 216:16-24 
(1989); Kennell et al., "Influence of Nuclear Background on Transcription of a Maize 

10 Mitochondrial Region Associated With Texas Male Sterile Cytoplasm," MoL Gen. 
Genet., 210:399-406 (1987); Singh et al., "Suppression of Cytoplasmic Male Sterility 
by Nuclear Genes Alters Expression of a Novel Mitochondrial Gene Region," Plant 
Cell, 3:1349-1362 (1991); Singh et al., "Nuclear Genes Associated With a Single 
Brassica CMS Restorer Locus Influence Transcripts of Three Different Mitochondrial 

15 Gene Regions," Genetics, 143:505-516 (1996); Wise et al., "Mitochondrial Transcript 
Processing and Restoration of Male Fertility in T-Cytoplasm Maize," J. Hered., 
90:380-385 (1999)). In Brassica, the presence of either one of two restorer genes 
results in monocistronic transcripts of atp6, instead of the dicistronic orf224/atp6 
transcripts found in CMS lines (Singh et aL, "Suppression of Cytoplasmic Male 

20 Sterility by Nuclear Genes Alters Expression of a Novel Mitochondrial Gene 

Region," Plant Cell 3:1349-1362 (1991)). A UG-rich sequence appears to be the 
target of the Brassica restorer alleles (Singh et al., "Nuclear Genes Associated With a 
Single Brassica CMS Restorer Locus Influence Transcripts of Three Different 
Mitochondrial Gene Regions," Genetics, 143:505-516 (1996)). In Petunia, pcf 

25 transcripts with 5 1 termini at -121 are specifically reduced in restored lines (Pruitt et 
al., "Transcription of the Petunia Mitochondrial CMS-Associated pc/Xocus in Male 
Sterile and Fertility-Restored Lines," Mol. Gen. Genet., 227:348-355 (1991)), while 
transcripts terminating at -266 and -522 remain at normal levels. In maize T 
cytoplasm, a sequence unlike either the Brassica restorer target or the Petunia -121 

30 transcript terminus is the putative recognition signal for the Rfl gene (Dill et al., "Rf8 
and Rf* Mediate Unique T-urfl 3 -Transcript Accumulation, Revealing a Conserved 
Motif Associated With RNA Processing and Restoration of Pollen Fertility in T- 
cytoplasm Maize," Genetics. 147:1367-1379 (1997)). 
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[0012] The steady-state amounts of the Petunia /?cf-encoded protein and the 

maize wr/73-encoded protein decrease greatly in restored lines compared to unrestored 
lines (Nivison et al., "Identification of a Mitochondrial Protein Associated With 
Cytoplasmic Male Sterility in Petunia," Plant Cell, 1:1 121-30 (1989); Dewey et aL, 

5 "Novel Recombinations in the Maize Mitochondrial Genome Produce a Unique 
Transcriptional Unit in the Texas Male-Sterile Cytoplasm," Cell, 44:439-49 (1986); 
Wise et al., "Urfl3-T of T Cytoplasm Maize Mitochondria Encodes a 13kD 
Polypeptide," Plant Mol. Biol.. 9:121-26 (1987)). Abundance of CMS-associated 
proteins is also reduced in sunflower and radish (Horn et al., "A Mitochondrial 16 

10 kDa Protein is Associated With Cytoplasmic Male Sterility in Sunflower," Plant Mol. 
Biol., 17:29-36 (1991); Laver et al., "Mitochondrial Genome Organization and 
Expression Associated With Cytoplasmic Male Sterility in Sunflower (Helianthus 
annuus)" Plant J., 1:185-193 (1991); Krishnasamy et aL, "Organ-Specific Reduction 
in the Abundance of a Mitochondrial Protein Accompanies Fertility Restoration in 

15 Cytoplasmic Male-Sterile Radish," Plant Mol. Biol., 26:935-946 (1994)). The 

mechanism behind the reduction in quantity of CMS-associated proteins in restored 
lines is not understood. For example, absence of transcripts that could potentially 
encode the PCF protein is not the explanation; only the shortest transcript is reduced 
in restored lines (Pruitt et al., "Transcription of the Petunia Mitochondrial CMS- 

20 Associated pc/Locus in Male Sterile and Fertility-Restored Lines/' Mol. Gen. Genet., 
227:348-355(1991)). 

[0013] In Petunia and in some other CMS/restorer systems, the abnormal gene 

is co-transcribed with known mitochondrial genes. One possible mechanism for CMS 
in Petunia and its restoration, which is also consistent with current data, is that the 
25 restorer gene not only results in decrease in the expression of PCF, but also improves 
the expression of the co-transcribed genes nad3 and rpsl2 in some way. For example, 
it remains possible that an RNA processing event results in little translation of PCF 
but enhanced production of NAD3 and RPS12 protein. 

[0014] In sum, with the exception of maize Rf2, in those systems where 

30 analysis has reached the molecular level, restorer genes have been found to affect the 
abundance of mitochondrial-encoded DNAs, RNAs, and proteins. 
[0015] Cytoplasmic male sterility/restorer systems have been proven to be an 

invaluable tool in the production of hybrid seeds. Despite their importance for both 
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the production of major crops such as rice and sunflower and the study of 
organelle/nuclear interactions in plants, none of the nuclear fertility-restorer genes 
that reduce the expression of aberrant mitochondrial proteins have been cloned. 
[0016] The present invention is directed to overcoming these deficiencies in 

5 the art. 

SUMMARY OF THE INVENTION 

[0017] The present invention relates to an isolated nucleic acid molecule 

which restores fertility to cytoplasmic male sterile plants and modifies expression of 

10 toxic mitochondria proteins by the plant. The nucleic acid molecule encodes a protein 
having an amino acid sequence of SEQ ID NOs: 2, 21, 23, 25, 27, 29, 31, 33, 35, 37, 
39, or 41 . Alternatively, the nucleic acid molecule encodes a protein containing a 
motif having an amino acid sequence corresponding to any of SEQ ID NOs: 3 to 18 or 
an amino acid sequence identified with a METAMEME software using the amino 

15 acid sequence of SEQ ID NO: 2 as input or an amino acid sequence identified as 

significantly similar to SEQ ID NO: 2 using a NCBI BLAST software (threshold= E 
less than or equal to 15) with SEQ ID NO: 2 as input. Alternatively, the nucleic acid 
molecule hybridizes to a nucleotide sequence of from nucleotide 1982 to 3760 of SEQ 
ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ 

20 ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, SEQ 
ID NO: 36, SEQ ID NO: 38, or SEQ ID NO: 40 under stringent conditions of a 
hybridization buffer containing 20% formamide in 0.9M saline/0.09M SSC buffer at a 
temperature of 42°C. Alternatively, the nucleic acid molecule has a nucleotide 
sequence of from nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ 

25 ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ 
ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, or 
SEQ ID NO: 40. 

[0018] Another aspect of the present invention relates to a method of 

identifying a candidate plant suitable for breeding with a cytoplasmic male sterile 
30 plant. The method involves analyzing the candidate plant for the presence, in its 
genome, of the above nucleic acid molecule of the present invention. 



WO 03/057859 



PCT/US03/00887 



[0019] Yet another aspect of the present invention relates to a method of 

identifying a candidate gene restoring fertility in plants. The method involves 
analyzing the candidate gene for the presence of the above nucleic acid molecule in 
accordance with the present invention. 

5 [0020] The present invention also relates to a method of producing hybrid 

plant seed. The method first involves providing a cytoplasmic male sterile plant. 
Next, a second plant containing the above nucleic acid molecule in accordance with 
the present invention is provided. Finally, the cytoplasmic male sterile plant and the 
second plant are bred under conditions effective to produce hybrid progeny seed 

10 which yield fertile plants. 

[0021] Another aspect of the present invention relates to a method of 

producing plant seeds for an inbred line of plants. The method first involves 
providing a cytoplasmic male sterile plant. Next, a second plant containing the above 
nucleic acid molecule in accordance with the present invention is provided. Then, the 

15 cytoplasmic male sterile plant and the second plant are bred under conditions effective 
to produce hybrid progeny seed which yield fertile plants. Next, hybrid fertile plants 
are produced from the hybrid progeny seeds. Finally, the hybrid fertile plants and the 
second plant are backcrossed to produce seed which yiled inbred progeny plants. 
[0022] Yet another aspect of the present invention relates to a method of 

20 directing gene expression to plant mitochondria. The method involves transforming a 
plant with a chimeric nucleic acid molecule containing a transgene operatively linked 
to a promoter or a terminator from a plant gene which restores fertility to cytoplasmic 
male sterile plants and modifies expression of toxic mitochondria proteins by the plant 
under conditions effective to direct expression of the transgene in the mitochondria of 

25 the transformed plant. The promoter has a nucleotide sequence of from nucleotide 1 
to 1981 of SEQ ID NO: 1. The terminator has a nucleotide sequence of from 
nucleotide 3761 to 4593 of SEQ ID NO: 1. 

[0023] The present invention also relates to a promoter from a plant gene 

which restores fertility to cytoplasmic male sterile plants and modifies expression of 
30 toxic mitochondria proteins by the plant under conditions effective to direct 
expression of the transgene in the mitochondria of the transformed plant. The 
promoter has a nucleotide sequence of from nucleotide 1 to 1981 of SEQ ID NO: 1 . 
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[0024] Another aspect of the present invention relates to a terminator from a 

plant gene which restores fertility to cytoplasmic male sterile plants and modifies 
expression of toxic mitochondria proteins by the plant under conditions effective to 
direct expression of the transgene in the mitochondria of the transformed plant. The 
5 terminator has a nucleotide sequence of from nucleotide 3761 to 4593 of SEQ ID NO: 
1. 

[0025] Yet another aspect of the present invention relates to a nucleic acid 

construct. The nucleic acid construct includes: (i) a promoter or a terminator from a 
plant gene which restores fertility to cytoplasmic male sterile plants and modifies 

10 expression of toxic mitochondria proteins by the plant under conditions effective to 
direct expression of the transgene in the mitochondria of the transformed plant and (ii) 
a nucleic acid heterologous to and operatively coupled to the promoter or the 
terminator. The promoter has a nucleotide sequence of from nucleotide 1 to 1981 of 
SEQ ID NO: 1. The terminator has a nucleotide sequence of from nucleotide 3761 to 

15 4593 ofSEQIDNO: 1. 

[0026] The present invention also relates to a method of expressing a gene 

preferentially in roots of a plant. The method involves transforming a plant with a 
nucleic acid construct. The nucleic acid construct includes a promoter suitable for 
driving expression preferentially in roots having a nucleotide sequence of from 1 to 

20 1388 of SEQ ID NO: 44; a nucleic acid heterologous to the promoter, where the 

promoter is operatively coupled 5' to the nucleic acid to induce transcription of the 
nucleic acid; and a terminator having a nucleotide sequence of from nucleotide 3 168 
to 4016 of SEQ ID NO: 44, where the terminator is operably coupled 3' to the nucleic 
acid. 

25 [0027] Another aspect of the present invention relates to a method of altering 

plant floral morphology in ornamental plants. The method involves transforming an 
ornamental plant with the above nucleic acid molecule in accordance with the present 
invention. 

[0028] Another aspect of the present invention relates to a method of 

30 producing plants with a cytoplasmic male sterile plant restoration system. The 
method first involves transforming a first plant in its chloroplast genome with a 
nucleic acid which causes the plant to become male sterile. Next a second plant is 
transformed with the above nucleic acid molecule in accordance with the present 
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invention whose protein product is targeted to the chloroplast. Finally, the first and 
second plants are crossed to produce progeny plants possessing a cytoplasmic male 
sterile plant restoration system. 

[0029] Another aspect of the present invention relates to a method of 

5 producing plants with a cytoplasmic male sterile plant restoration system. The 

method first involves mutagenizing a first plant having a nucleic acid which encodes a 
protein. The protein has a motif having an amino acid sequence corresponding to any 
of SEQ ID NOs: 3 to 18 or an amino acid sequence identified with a METAMEME 
software using the amino acid sequence of SEQ ID NO: 2 as input or an amino acid 

10 sequence identified as significantly similar to SEQ ID NO: 2 using a NCBI BLAST 
software (threshold= E less than or equal to 1 5) with SEQ ID NO: 2 as input. Next, 
the mutagenized first plant is crossed with a wild-type plant having mitochondrial 
DNA polymorphisms compared to mitochondrial DNA in the mutagenized first plant 
to produce progeny plants. Finally, it is determined if the progeny plants are fertile, 

15 whereby fertile progeny plants can be used as a fertile maintainer line, where the 
mutagenized first plant, the fertile maintainer line, and a wild-type allele present in 
the first plant before mutagenesis comprises a new cytoplasmic male sterile plant 
restoration system. 

[0030] The present invention also relates to an isolated nucleic acid sequence 

20 corresponding to SEQ ID NO: 42 or SEQ ID NO: 44. 

[0031] The present invention identifies nucleic acid sequences which encode 

the gene for restoration of fertility to cytoplasmic male sterile plants. This gene 
modifies the expression of the mitochondrial genome and is the first such gene 
sequence that has been identified. In petunia, the gene may be transferred to lines 

25 lacking the gene in order to restore fertility. More importantly, the gene sequence has 
characteristics that can be used to identify comparable genes from economically 
important species. Thus, the gene and the sequence information may be used to 
develop hybrid seed production systems in economically important plants. 
Furthermore, the information may be used in crop improvement by controlling 

30 mitochondrial gene expression. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0032] Figures 1 A-D show that the i?/locus contains two tandem 

mitochondrially targeted PPR motif genes. Figure 1 A illustrates the genomic 
organization of the region containing the R/-PPR592 and R/-PPR591 genes. 
Duplicated blocks are indicated by similar shading. Arrows indicate the direction of 
transcription. 1 and 2 show locations of the primers used to amplify the rf-PPR592 
gene from a CMS plant. Figure IB shows a single onion epidermal cell expressing a 
known mitochondrially targeted green fluorescent protein (GFP) after DNA 
bombardment. Figure 1C shows a single onion epidermal cell transiently expressing 
44 N-terminal amino acids of Rf-PPR592 fused to GFP. Figure ID shows a 
comparison of PPR motifs found in Rf-PPR592 with the MEME-derived consensus 
from 1,303 PPR motifs. The 14 PPR repeats are sorted by decreasing statistical 
significance, with PPR 230-264 showing the highest match to the consensus motif 
that is generated by retaining only the amino acids that occur at least in 6 of the 
14 repeats. 

[0033] Figures 2A-B show the genetic structure of the rf-PPR592 gene. 

Figure 2A illustrates that a comparison of R/-PPR592 and rf-PPR592 reveals a size 
polymorphism. The first lane was loaded with the R/-PPR592 PCR amplicon 
obtained from a restorer line (RflRj), the adjacent lane was loaded with the rf-PPR592 
PCR amplicon obtained with the same primer pair from a CMS line {rflrf). Figure 2B 
illustrates that a comparison of R/-PPR592, Rf-PPR591, and rf-PPR592 reveals five 
similarity blocks. For each block, (I to V), the two blocks that exhibit the greatest 
similarity are shown with the same shading. Overall all three sequences are greater 
than 90% identical at the nucleotide level except in block V, where R/-PPR591 
exhibits only 23% identity to the other two genes. The locations of 47- and 49-nt 
deletions in Rf-PPR591 and 47- and 530-nt deletions in rf-PPR592 with respect to the 
Rf-PPR592 sequence in blocks I and II are shown as lines. 
[0034] Figures 3A-B show the expression pattern of rfPPR592 and Rf- 

PPR592. Figure 3 A depicts the examination of floral bud RNA for expression of rf- 
PPR592 and R/-PPR592. RT-PCR of floral bud RNA of a CMS plant (S) with 
primers specific to rf-PPR592, and RT-PCR of floral bud RNA of an RJRf 
(nontransgenic) fertile plant with primers specific for R/-PPR592 (R). DNA, positive 
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control for the amplification where the substrate is leaf DNA from a CMS plant; M, 
mass markers; 0, no template added, negative control- Figure 3B depicts the 
examination of different tissues for expression of rf-PPR592. RT-PCR of RNA from 
different tissues of a CMS plant with primers specific to rf-PPR592. DNA, M, and 
5 0 are same as in Figure 3 A. 

[0035] Figures 4A-D illustrate the restoration of fertility to CMS Petunia lines 

by transformation with a 4.6-kb genomic sequence carrying R/-PPR592. Figure 4A 
shows the flower of P. parodii CMS line 3688. Figure 4B shows the regenerant 
carrying R/-PPR592. Figure 4C shows the P. hybrida CMS line 2423. Figure 4D 

1 0 shows the regenerant carrying R/-PPR592. 

[00361 Figures 5 A-B illustrate the cosegregation of the R/-PPR592 transgene, 

restoration of fertility, and reduction of PCF. Figure 5 A shows the DNA blot 
hybridized with an npt II transgene-specific probe. Lane 1, P. parodii CMS line 
3688; lanes 2-and 3, sterile Ti progeny of transformed P. parodii^ lanes 4-9, fertile T\ 

15 progeny. Figure 5B shows the immunoblot of floral bud proteins probed with anti- 
PCF antibody. Lanes are as in Figure 5A. 

[0037] Figures 6A-B illustrate abnormal flowers on plants obtained by 

introducing 7?/^PPR592 into a CMS background. Figure 6A shows a petaloid flower 
on a plant carrying a recombination event near the i?/locus, affecting the region 5' to 
20 7?/ : PPR592. Figure 6B shows an abnormal flower on a plant carrying the CMS 
cytoplasm and the 4.5 kb i?/-PPR592 transgene. 

[0038] Figures 7 A-B show methods for creating a new CMS/restorer system. 

[0039] Figure 8 shows a two-line method for hybrid rice production, using an 

engineered inducible restorer gene. 

25 

DETAILED DESCRIPTION OF THE INVENTION 

[0040] The present invention relates to an isolated nucleic acid molecule 

which restores fertility to cytoplasmic male sterile plants and modifies expression of 
toxic mitochondria proteins by the plant. 
30 [0041] One form of the nucleic acid molecule of the present invention is a 

nucleotide sequence of from nucleotide 1982 to 3760 of SEQ ID NO: 1, identified 
herein as R/-PPR592, as follows: 
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1 ATATATATATACAAACTGATTTTTTCTGTCTATTTGCACAGTGTTATATTTACATACCCTTGAAAAAGGGTAGCTCCGCT 
81 AATAATGTTATCTTTACAAAAAATAACAATAATTTTTTTACATAATATATACAAAACTCATTGTTATGTATTGTAAATAT 
161 GATAAAAATATTGTTATTTTTTGTAATATAGCTATTAGGTAGTCATGTGGTGTAATATTTCCTAAAAATATTTACCTGAG 

2 41 TCGGCCATTTGGCTA7^AAATATTTTATTTTATAGTCGCATATACTCCAAGCTTGTATATCCCATAGCGACAGTATACCTA 
5 321 TACGATATCTTCTATTATTTACCTTTTTAGTATTCGTATACCCCCAATAGTATACAAGTTTGACACCGCAATAGTGTACC 

4 01 CCAATGTTGTGGTTGTGTGGCTATAAAATGTTTAAACAAAATTTGAGTGATGGTAGTGTAATTTTTTAGTGTAAGCTGGG 

4 81 T AGT TTT AAAAAC AT T CT T T T T GAAAAT TG T AGT T C AAGT C AT AGT AC AAAAAAC T GAAAT ATT T AT AT G TT T C T T GAT T 

5 61 TTGGCTGGTCTTCTAAAAATTTTGAAATGCTGGCTAGTTTTCATTTAGCGAGGGGCAAATAGACTACATGGCCAAATTTT 
641 TACGTTAAAAGAAGTGTTGTCTGGGAAAGTATTCGAAAAGATTGTACGGACAAGTGTTGCCTAGACAACACGTCAAATTA 

10 721 T GT AGAAAAAT GT AGG AAG AAAT TC AAAAGCAAAT AT T GC T T AAGC AAAAGG C AGT CAAAG ACAAT GC T G CCT T AGGG AG 

801 T GAG AAAT G GGC AT C ACT AT AAGAT T GT AT T T C CAT T CGAT AT T TAT T C AT T AT AAAC T T AAGG AAAAGT GC AAGG AAAA 

881 GCCACTTTTTGGCTTGCCTTTACCGTTGAAGCTACTTTCAAAGAAAAAGAGCTAGTTTTTAGCTTTTTTGGAACTTTAAT 

961 CATTGTGGGCCGAACTTCAGACCTTGTGGGCCGAACTTCATACATTCACAAGTAAAAAATTAGCTCACAGGCCACTTTTA 

1041 CCACTAGTATTTGGTTTGAAGTCATTTTTTTATTGGTTTTACATGAGAGACCACTTTTTGGAACTTCAATCTTTGTGCGC 

15 1121 TTGAACTTCATGCCTAAGTTATTAAGTTCAACTTCAATCCGTAAGGGCTGAATTTTTAGGCATAGATGCGTAAACTTCAA 

1201 CCTTGTGGACTGAAGTTGAACTTCGCCCCTTATGGTGGCCTGAAGTTGAACTTCAATCCTTGTGGGCTGAACTTGTGTGA 

1281 AGTTCAACCCACAAGGATTAAAGTTTCAAAAAATGACCTCTCAAGCAAAATCTGCAAAAAAAAGTGGTCTCTCATGCACT 

13 61 TTTACCCATTCGC7^AAGTAGGCTGAAGTTCAGCCCAC7\ATTATTCAAGTTCCAAAAAATTTCACAATATATACCTCCTTA 

14 41 TCTCGGTTATGATCTTTTGTATGATTTAGCAAAATGGACGGGGAAAGTGCACGAAAGACCACTTTTGCCATTGGTCTTTG 
20 1521 GGTACAGGCCACTAATACCAAAATATTTAGTTTGTGGCTACTTTTGCTTAAAGAGATAGAACTTCAGTCCAGAGGCCGGA 

1601 TTGAAGTTCAGTCCTTAAAGATTGAACTTCGATCCAGTGCCATATGGACTGAAGTTCAGTCAAGTCCTTAAGATGGAACT 

1681 TCAGTCCAGAGCCATATGGACTGAAGTTCAATCCTTAAAGATAGAACTTCAGTCCAGGGGCCGTATGGACTGAAGTTCAG 

17 61 TCAATTATCAGAACTTAAGTCAGTATTTATTTAGTAAAGGCCCAAAAGTGGTTAGTATAAGACCAATAAAAATAGAGGCC 

18 41 TAAAACTAAATAACAGTGTTAAAAGTGGCTGATGGACGAAATTTCTACAAAATGGACTCGAGGTAGCAATTCAACTTCAA 
25 1921 CCTATGGTGTCATAGTCGTACAATTCTTCCAATCACCCCTACTAAGTGAAGTGAAGCGAAGATGATGAGAATTGCAGTGC 

2001 GTTACTGTCTCAATGGTAATCCCTTTTTCTCATTCTTTGCTTATTCAATTGCACCCCGACATTATTCTACCAATACATGT 

2081 TCCATTTCAGTTAAAGGGAATTTTGGGGTTTCTAATGAATTTGAGAATGTTAAGTGTTTAGATGATGCTTTCAGTTTGTT 

2161 CCGTCAAATGGTTACAACTAAGCCTCTTCCTTCTGCTGTCTCTTTCTCTAAATTGTTGAAAGCTTTGGTACATATGAAGC 

2241 ATTACTCTTCTGTTGTTTCTATTTTTCGAGAAATCCACAAATTACGTATTCCTGTTGATGCTTTCGCCTTGAGCACTGTG 

30 2 321 GTTAACAGTTGTTGCCTTATGCATCGTACCGATCTCGGATTTTCTGTATTAGCCATTCACTTCAAGAAAGGTATTCCATA 

24 01 TAATGAAGTCACCTTTACTACCTTAATAAGGGGACTTTTTGCTGAAAATAAGGTCAAAGATGCTGTTCATTTGTTCAAAA 

2 4 81 AGTTGGTGAGGGAGAATATATGTGAGCCTGATGAAGTCATGTATGGGACGGTCATGGATGGGCTTTGCAAGAAGGGCCAT 

2561 AC T C AAAAAGCT TTT GAT T T GC TCC GGT T AAT GG AAC AAGG AAT T AC T AAGC C C GAT AC AT G C AT CT AC AAC AT T GT TAT 

2 641 CGAT GCCTTTTGCAAAGATGGGATGCTAGATGGTGCTACCAGCCTTTTGAACGAGATGAAACAAAAAAACATT CCT CCAG 

35 2721 ACATTATTACATATACCTCATTGATCGATGGTTTGGGTAAGTTAAGTCAGTGGGAAAAGGTTAGGACTTTGTTCCTTGAG 

2801 ATGATACATCTTAATATTTATCCAGATGTGTGCACCTTCAACTCCGTCATTGATGGACTATGCAAAGAGGGGAAAGTTGA 

2881 AG AT GC CG AGG AAAT AAT G AC AT AC AT G AT CG AAAAAGG T GT AG AAC C T AAT GAG AT AAC C T AC AAT G T G GT AAT G GAT G 

2 961 GAT AT T GC TTGCGTGG T C AAAT GG G T AG AG CG AG GAG AAT T T T T GAT T C CAT GAT AGAT AAG GGC AT T G AGCC T GAT AT C 
3041 AT TAG C T AT ACC G C AC T AAT AAAT G G AT AC GT CG AG AAAAAG AAAAT GG AT AAG GC CAT GCAATT G TTT C GT GAAAT T T C 

40 3121 TCAAAATGGATTGAAACCTAGTATTGTTACCTGCAGTGTTCTCTTGCGTGGTCTTTTTGAAGTTGGAAGAACTGAATGTG 

3201 CAAAAATATTCTTTGATGAGATGCAAGCTGCGGGGCACATACCTAATTTATACACTCATTGCACTTTGCTTGGTGGTTAT 

3281 TTTAAGAATGGACTTGTTGAAGAGGCTATGTCACACTTCCATAAGTTGGAAAGGAGGAGAGAAGATACAAATATTCAAAT 

3361 TTACACGGCTGTCATTAATGGATTGTGCAAAAATGGTAAGCTCGACAAAGCTCATGCTACGTTTGAGAAGCTTCCCTTGA 

34 41 T AGGC T T AC AT CCT G AT GT GAT AAC AT AC ACT GC AAT G AT T AGT GG AT AT T G T C AAG AAG GG T TGT T AG ATG AAGC T AAA 

45 3521 GATATGCTAAGGAAAATGGAGGACAATGGTTGTTTGCCAGACAACCGAACATACAATGTTATTGTGCGGGGATTTTTCAG 

3 601 AAGC AG T AAAGT T AG T GAAAT G AAG GCTTTTCT G AAGG AAAT AG C T GGG AAG AG CT TC T C AT T TG AGG CAGC TACT G TAG 
3 681 AGTTATTGATGGATATTATAGCAGAGGATCCTTCTTTGCTTAACATGATTCCAGAATTTCACCGGGATAATAAGAAGTGA 
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37 61 AT AACT T T T GC ACCTG T x T T T T T T G ACG AT AT C ACC AT TAT T C T GC T AT T T C CT T T CAT CT T AG C AAAAG AAAT T GC AT C 

3841 CAGTGGAATTGCGGAAGCTGAAAAAATGGCAAGAAGAACATTGCTTAAGCTTTCCTGGCAAGCTTATATCGGAGGGACAT 

3921 CATTTTGGTTGTTTTGGCTCTCTTCTTTATCTTGGAAATCAAATGTTCTGCGCTCTTAATATCAGAAACAATGTGAACTC 

4001 C CAT AT AT GT AC G AGT T AT AAGT T T C GG AAT AT G AT TT C AAT GGT T T C AG TAT T CT AT T T T T GAT AT G G AAT T AAT T T T T 

5 4081 GAGCGACCCAGTGTTGACCATTGCCTACCTTCGGTTATTATATGATTGAAATTCCCTCCAATCTCCAATACTCACTTCAT 

4161 TTTGTCTTGTTGAATTTTTCAATTTTTCTTTTTCTGTTACGATTGTCATTTTCACCGCCTTGAGTATCCATCAGGTTCCA 

4241 GTTGAAAAAGAATCATTTTTTGCCATGACCATCATGCTTTCTGAGTGCAAGATCAAGAGAGGTACTTTTCTCTCTAAGAA 

4321 CCTCTTGGTTTTTTAAGTGTTCTGGGTTCTTTCAGTACTTTTAAGCTATTTTCTAATCCTTTGAAGAGATTCATACATAT 

4401 CTGTGCATGTGTTTGTTTCTTTTTTTCGGGTGATACTTTGTTTTATAGCTAAGGATTGAAAAGGTAATTTTCATTTTCAT 

10 4481 TAGCAATAGATATGAAACAGCTTTGTAAGGACTCTGGAGTCTCCTAAAAATTTTGGCTATGCAAATAGCCTATTGCATCA 

4561 ATTTGTCGTTGAAATCCATGTATCATAAAAAAA 



R/-PPR592, isolated from Petunia has an open reading frame ("ORF") of 1779 bp, 
extending between nucleotides 1982-3760. 
15 [0042] The nucleic acid molecule of the present invention which has the 

nucleotide sequence of from nucleotide 1982 to 3760 of SEQ ID NO: 1 encodes a 
protein or polypeptide having a deduced amino acid sequence of SEQ ID NO: 2, as 
follows: 



20 MMRIAVRYCLNGNPFFSFFAYSIAPRHYSTNTCSISVKGNFGVSNEFENVKCLDDAFSLFRQMVTTKPLPSAVSFS 
KLLKALVHMKHYSSWSIFREIHKLRIPVDAFALSTWNSCCLMHRTDLGFSVLAIHFKKGIPYNEVTFTTLIRGL 
FAENKVKDAVHLFKKLVRENICEPDEVMYGTVMDGLCKKGHTQECAFDLLRLMEQGITKPDTCIYNIVIDAFCKDGM 
LDGATSLLNEMKQKNIPPDIITYTSLIDGLGKLSQWEKVRTLFLEMIHLNIYPDVCTFNSVIDGLCKEGKVEDAEE 
IMTYMIEKGVEPNEITYNVVMDGYCLRGQMGRARRIFDSMIDKGIEPDIISYTALINGYVEKKKMDKAMQLFREIS 

25 QNGLKPSIVTCSVLLRGLFEVGRTECAKIFFDEMQAAGHIPNLYTHCTLLGGYFKNGLVEEAMSHFHKLERRREDT 
NIQIYTAVINGLCKNGKLDKAHATFEKLPLIGLHPDVITYTAMISGYCQEGLLDEAKDMLRKMEDNGCLPDNRTYN 
V I VRG FFR S S KV S EMKAFLKE I AGK S F S FE AAT VE L LM D 1 1 AE DPSLLNMIPE FHR DNKK 



[0043] As shown in Figure ID, most of the predicted mature protein (87%) of 

30 Rf-PPR592 consists of 14 pentatricopeptide repeat motifs (PPRs). These repeats 
extend from the amino acid in position 54 to the amino acid in position 544 and are 
organized in two sets of tandem repeats, one set containing 3 PPRs from amino acid 
54 to amino acid 158, the other set containing 1 1 PPRs from amino acid 160 to amino 
acid 544. Thus, another suitable nucleic acid molecule in accordance with the present 
35 invention encodes a protein containing a motif having an amino acid sequence 

corresponding to any of the PPR motifs (SEQ ID NOs: 3 to 18), where SEQ ID NO: 3 
is as follows: 
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E E A . . Li Y . . M . . . G . . 

where SEQ ID NO: 4 is as follows: 

5 D.A...F..M...G.. 

where SEQ ID NO: 5 is as follows: 

DDAFSLFRQMVTTKPL 

10 

where SEQ ID NO: 6 is as follows: 

SSVVSIFREIHKLRIP 

15 where SEQ ID NO: 7 is as follows: 

DLGFSVLAIHFKKGIP 

where SEQ ID NO: 8 is as follows: 

20 

DAVHLFKKLVRENICE 

where SEQ ID NO: 9 is as follows: 



PN. . TYNALINAYAK . G . . 



PD. . T Y . . L I . G L C K . G . - 



P SAVS FSKLLKALVHMKHY 



VDAFALSTVVNS CCLMHRT 



YNEVTFTTLI RGLFAENKV 



PDEVMYGTVMDGLCK KGHT 



DTCIYNIVIDAFCKDGML 



DIITYTSLIDGLGKLSQW 



DVCT FNSVIDGLCKEGKV 



NEITYNVVMDGYCLRGQM 



25 QKAFDL.LRLMEQGITKP 

where SEQ ID NO: 10 is as follows: 

DGATSLLNEMKQKNIPP 

30 

where SEQ ID NO: 11 is as follows: 

EKVRTLFLEMIHLNIYP 

35 where SEQ ID NO: 12 is as follows: 

EDAEE IMTYMIEKGVEP 
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where SEQ ID NO: 13 is as follows: 

GRARR IFDSMIDKGIEPDI ISYTAIjINGYVEKKKM 

5 

where SEQ ID NO: 14 is as follows: 

DKAMQLFRE I SQNGLKPS IVTCSVLLRGLFEVGRT 

10 where SEQ ID NO: 15 is as follows: 

ECAKI FFDEMQAAGHI PNLYTHCTLLGGYFKNGLV 

where SEQ ID NO: 16 is as follows: 

15 

EEAMSHFHKLERRREDTNIQIYTAVINGLCKNGKL 

where SEQ ID NO: 17 is as follows: 

20 DKAHATFE KLPLI GLHPDVITYTAMISGYCQEGLL 

and where SEQ ID NO: 18 is as follows: 

DEAKDMLRKMEDNGCLPDNRTYNVIVRGFFRSSKV 

25 

A PPR motif-containing gene can be identified if it contains the consensus sequence 
(SEQ ID NOs: 3 or 4) or if it is found with a MEME software (Bailey et aL, "Fitting a 
Mixture Model by Expectation Maximization to Discover Motifs in Biopolymers," 
Proceedings of the Second International Conference on Intelligent Systems for 

30 Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California (1994), which is 
hereby incorporated by reference in its entirety). To find whether a protein has a PPR 
motif with the MEME software, the parameters for motif searching should be set as 
minimum width=35, maximum width=35. MEME (Multiple Em for Motif 
Elicitation) is a software tool for discovering motifs in a group of related DNA or 

35 protein sequences (Bailey et ah, "Fitting a Mixture Model by Expectation 

Maximization to Discover Motifs in Biopolymers," Proceedings of the Second 



WO 03/057859 PCT/US03/00887 

- 18- 

International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, 
AAAI Press, Menlo Park, California (1994), which is hereby incorporated by 
reference in its entirety). MEME takes as input a group of DNA or protein sequences 
(the "training set") and outputs as many motifs as requested. MEME uses statistical 
5 modeling techniques to automatically choose the best width, number of occurrences, 
and description for each motif. MEME represents motifs as position-dependent letter- 
probability matrices which describe the probability of each possible letter at each 
position in the pattern. Individual MEME motifs do not contain gaps. Patterns with 
variable-length gaps are split by MEME into two or more separate motifs. 

10 [0044] Another suitable nucleic acid molecule for the present invention is a 

nucleic acid molecule which encodes a protein having an amino acid sequence 
identified with a METAMEME software using the amino acid sequence of SEQ ID 
NO: 2 as input. Meta-MEME is a software toolkit for building and using motif-based 
hidden Markov models of DNA and proteins. The input to Meta-MEME is a set of 

15 similar protein sequences, as well as a set of motif models discovered by MEME. 

Meta-MEME combines these models into a single, motif-based hidden Markov model 
and uses this model to produce a multiple alignment of the original set of sequences 
and to search a sequence database for homologs (Grundy et al., "Meta-MEME: Motif- 
based Hidden Markov Models of Biological Sequences," Computer Applications in 

20 the Biosciences, 13(4):397-406 (1997), which is hereby incorporated by reference in 
its entirety). 

[0045] Also suitable for the present invention is a nucleic acid molecule which 

encodes a protein having an amino acid sequence identified as significantly similar to 
SEQ ID NO: 2 using a NCBI BLAST software (threshold= E less than or equal to 15) 
25 with SEQ ID NO: 2 as input for comparison (Fulton et al., "Identification, Analysis, 
and Utilization of Conserved Ortholog Set Markers for Comparative Genomics in 
Higher Plants," Plant Cell 14:1457-1467 (2002), which is hereby incorporated by 
reference in its entirety). 

[0046] Also suitable in the present invention is a nucleic acid molecule which 

30 has a nucleotide sequence of SEQ ID NO: 1 9, an 8.5 kb fragment containing Rf- 
PPR592 capable of transforming cytoplasmic male sterile plants, as follows: 
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G GAT C C AAAAT T T C AC T AAAG G T T AAACG C G AGG AT AC T G AAGT T GG AG AG C AAT GT G G T AT C T T GGT G CAT GG AC 
GGAGTCATGGGGGTATAGTTGCTGGCTATTTAGCTAACTGGAGAACTGTTGTGAGGAATTATTTAAAGAATGCTAC 
TTTCTCGTCCACATAAACATGTCCAAATATTTTCTACTTGATAGAGAGTTCAAGGAAATAGTGTGGATTTCTTCCC 
AAAC AC AAAC G ATT T G AG AAAACT G AAGT G AAGG C T G AAG AG AAAACT AAAG AG AAC T GG AAG C T AAG AAAC AC AG 
5 AAGC AC AAACCT AT AAACATAGAC ACTGG C AT GT T G C AG AAAAT T T T AAC T T T CG AT T CT CC AGT AG AAAGACAC A 
AATACATCAGTAAATTTTCTTTAGGCTCAAGCAAGGATACATCTTGGTAGAATTTGCATTATACCAACATAATAAG 
CTCAAAA7\AATAACTAAGCTGCAACTAGCTACTTGGTCCGAGAAGCTTTTGCTATCAGGAAGTTCCACAGTTCCAA 
AACC AAG GT AG ACT AT C AAG CT AG T T C CC AC C AG T CAT T T T C T TAG AC T T T G C T C T C AC G AT AAACT AAG AT CAT T 
TTTTTATGATACATGGATTTCAACGACAAATTGATGCAATAGGCTATTTGCATAGCCAAAATTTTTAGGAGACTCC 

10 AGAGTCCTTACAAAGCTGTTTCATATCTATTGCTAATGAAAATGAAAATTACCTTTTCAATCCTTAGCTATAAAAC 
AAAG TAT C AC C CG AAAAAAAG AAAC AAAC AC AT G C AC AG AT AT G TAT G AAT C T C T T CAAAGGAT TAG AAAAT AG C T 
TAAAAGTACTGAAAGAACCCAGAACACTTAAAAAACCAAGAGGTTCTTAGAGAGAAAAGTACCTCTCTTGATCTTG 
CACTCAGAAAGCATGATGGTCATGGCAAAAAATGATTCTTTTTCAACTGGAACCTGATGGATACTCAAGGCGGTGA 
AAAT G AC AAT C GT AAC AG AAAAAG AAAAAT T G AAAAAT T C AAC AAG AC AAAAT G AAGT G AGT AT T GGAG ATT GG AG 

15 GGAATTTCAATCATATAATAACCGAAGGTAGGCAATGGTCAACACTGGGTCGCTCAAAAATTAATTCCATATCAAA 
AAT AG AAT ACT G AAAC CAT T G AAAT CAT AT T C C G AAAC T TAT AAC T C G T AC AT AT AT G GG AG T T C ACAT T G T T T CT 
GAT AT T AAG AG C G C AG AAC AT T T GAT T T C C AAG AT AAAG AAG AG AGC CAAAAC AACC AAAAT GAT GTCCCTCC GAT 
ATAAGCTTGCCAGGAAAGCTTAAGCAATGTTCTTCTTGCCATTTTTTCAGCTTCCGCAATTCCACTGGATGCAATT 
T C TT T T G CT AAG AT GAAAGG AAAT AG C AG AAT AAT GGT GAT AT CGT C AAAAAAAAC AG G T G C AAAAG T TAT T C AC T 

20 T C T TAT TAT C C CG GT G AAAT T C T G GAAT C AT GT T AAG C AAAG AAGG AT C C T C T G C TAT AAT AT C CAT C AAT AACT C 
TACAGTAGCTGCCTCAAATGAGAAGCTCTTCCCAGCTATTTCCTTCAGAAAAGCCTTCATTTCACTAACTTTACTG 
CTTCTGAAAAATCCCCGCACAATAACATTGTATGTTCGGTTGTCTGGCAAACAACCATTGTCCTCCATTTTCCTTA 
GCATATCTTTAGCTTCATCTAACAACCCTTCTTGACAATATCCACTAATCATTGCAGTGTATGTTATCACATCAGG 
ATGTAAGCCTATCAAGGGAAGCTTCTCAAACGTAGCATGAGCTTTGTCGAGCTTACCATTTTTGCACAATCCATTA 

25 ATGACAGCCGTGTAAATTTGAATATTTGTATCTTCTCTCCTCCTTTCCAACTTATGGAAGTGTGACATAGCCTCTT 
C AAC AAGT C C AT T C T T AAAAT AAC C AC C AAG C AAAGT GC AAT G AG TG TAT AAAT T AGGT AT G T GC C C C G C AG C T T G 
CATCTCATCAAAGAATATTTTTGCACATTCAGTTCTTCCAACTTCAAAAAGACCACGCAAGAGAACACTGCAGGTA 
ACAATACTAGGTTTCAATCCATTTTGAGAAATTTCACGAAACAATTGCATGGCCTTATCCATTTTCTTTTTCTCGA 
CGTATCCATTTATTAGTGCGGTATAGCTAATGATATCAGGCTCAATGCCCTTATCTATCATGGAATCAAAAATTCT 

30 CCTCGCTCTACCCATTTGACCACGCAAGCAATATCCATCCATTACCACATTGTAGGTTATCTCATTAGGTTCTACA 
CCTTTTTCGATCATGTATGTCATTATTTCCTCGGCATCTTCAACTTTCCCCTCTTTGCATAGTCCATCAATGACGG 
AGT T G AAGGT G C AC AC AT C T GG AT AAAT AT T AAG ATG TAT CAT C T C AAGG AAC AAAGT C C T AAC C T T T T C C C AC T G 
ACTTAACTTACCCAAACCATCGATCAATGAGGTATATGTAATAATGTCTGGAGGAATGTTTTTTTGTTTCATCTCG 
T T C AAAAGG C T GGT AG C AC CAT CT AG CAT C C CAT C TT T G C AAAAG GCAT C GAT AAC AAT G T T G TAG AT G C AT G T AT 

35 CGGGCTTAGTAATTCCTTGTTCCATTAACCGGAGCAAATCAAAAGCTTTTTGAGTATGGCCCTTCTTGCAAAGCCC 
ATCCATGACCGTCCCATACATGACTTCATCAGGCTCACATATATTCTCCCTCACCAACTTTTTGAACAAATGAACA 
GCATCTTTGACCTTATTTTCAGCAAAAAGTCCCCTTATTAAGGTAGTAAAGGTGACTTCATTATATGGAATACCTT 
TCTTGAAGTGAATGGCTAATACAGAAAATCCGAGATCGGTACGATGCATAAGGCAACAACTGTTAACCACAGTGCT 
CAAGGCGAAAGCATCAACAGGAATACGTAATTTGTGGATTTCTCGAAAAATAGAAACAACAGAAGAGTAATGCTTC 

40 AT AT GT AC C AAAG C T T T C AAC AAT T TAG AG AAAG AG AC AG C AG AAGG AAG AGG C T TAG T T GT AAC CAT T T G AC G G A 
ACAAACTGAAAGCATCATCTAAACACTTAACATTCTCAAATTCATTAGAAACCCCAAAATTCCCTTTAACTGAAAT 
GGAACATGTATTGGTAGAATAATGTCGGGGTGCAATTGAATAAGCAAAGAATGAGAAAAAGGGATTACCATTGAGA 
CAGTAACGCACTGCAATTCTCATCATCTTCGCTTCACTTCACTTAGTAGGGGTGATTGGAAGAATTGTACGACTAT 
GACACCATAGGTTGAAGTTGAATTGCTACCTCGAGTCCATTTTGTAGAAATTTCGTCCATCAGCCACTTTTAACAC 
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TGTTATTTAGTTTTAGGCCTCTATTTTTATTGGTCTTATACTAACCACTTTTGGGCCTTTACTAAATAAATACTGA 
CTTAAGTTCTGATAATTGACTGAACTTCAGTCCATACGGCCCCTGGACTGAAGTTCTATCTTTAAGGATTGAACTT 
CAGTCCATATGGCTCTGGACTGAAGTTCCATCTTAAGGACTTGACTGAACTTCAGTCCATATGGCACTGGATCGAA 
G T T C AAT CT T T AAGG AC T G AAC T T C AAT C C GG C C T C T G G ACT G AAG T T CT AT C T CT T T AAG C AAAAG T AG C C AC AA 
5 ACTAAATATTTTGGTATTAGTGGCCTGTACCCAAAGACCAATGGCAAAAGTGGTCTTTCGTGCACTTTCCCCGTCC 
AT T T T G C T AAAT CAT AC AAAAG AT CAT AAC CG AG AT AAGG AG G TAT AT AT T GT G AAAT T T T T T G G AAC T T G AAT AA 
TTGTGGGCTGAACTTCAGCCTACTTTGCGAATGGGTAAAAGTGCATGAGAGACCACTTTTTTTTGCAGATTTTGCT 
TGAGAGGTCATTTTTTGAAACTTTAATCCTTGTGGGTTGAACTTCACACAAGTTCAGCCCACAAGGATTGAAGTTC 
AACTTCAGGCCACCATAAGGGGCGAAGTTCAACTTCAGTCCACAAGGTTGAAGTTTACGCATCTATGCCTAAAAAT 

10 T C AG C C CT T AC G GAT T G AAGT T G AAC T T AAT AAC T T AGGC AT GAAG TT C AAG CG C AC AAAG AT T G AAG T T C C AAAA 
AGTGGTCTCT CAT G T AAAAC C AAT AAAAAAAT G AC T T C AAAC C AAAT AC TAG T G GT AAAAGT G G C C T G T GAG C T AA 
TTTTTTACTTGTGAATGTATGAAGTTCGGCCCACAAGGTCTGAAGTTCGGCCCACAATGATTAAAGTTCCAAAAAA 
GCTAAAAACTAGCTCTTTTTCTTTGAAAGTAGCTTCAACGGTAAAGGCAAGCCAAAAAGTGGCTTTTCCTTGCACT 
TTTCCTTAAGTTTATAATGAATAAATATCGAATGGAAATACAATCTTATAGTGATGCCCATTTCTCACTCCCTAAG 

15 GCAGCATTGTCTTTGACTGCCTTTTGCTTAAGCAATATTTGCTTTTGAATTTCTTCCTACATTTTTCTACATAATT 
TGACGTGTTGTCTAGGCAACACTTGTCCGTACAATCTTTTCGAATACTTTCCCAGACAACACTTCTTTTAACGTAA 
AAATTTGGCCATGTAGTCTATTTGCCCCTCGCTAAATGAAAACTAGCCAGCATTTCAAAATTTTTAGAAGACCAGC 
CAAAATCAAGAAACATATAAATATTTCAGTTTTTTGTACTATGACTTGAACTACAATTTTCAAAAAGAATGTTTTT 
AAAACT ACCCAGCTTACACTAAAAAAT T AC ACT AC CAT CACTCAAATT T TGT TT AAAC AT T T T AT AGC C ACAC AAC 

20 CACAACATTGGGGTACACTATTGCGGTGTCAAACTTGTATACTATTGGGGGTATACGAATACTAAAAAGGTAAATA 
ATAGAAGATATCGTATAGGTATACTGTCGCTATGGGATATACAAGCTTGGAGTATATGCGACTATAAAATAAAATA 
T T T T T AGC C AAAT GGC C G ACT C AG G T AAAT AT T T T TAG G AAAT AT T AC AC C ACAT G ACT AC C T AAT AG C TAT ATT A 
C AAAAAAT AAC AAT AT T T T TAT CAT AT T T AC AAT AC AT AAC AAT G AGT TT TGT AT AT ATT AT G T AAAAAAAT TAT T 
GTTATTTTTTGTAAAGATAACATTATTAGCGGAGCTACCCTTTTTCAAGGGTATGTAAATATAACACTGTGCAAAT 

25 AG AC AG AAAAAAT C AG T T T GT AT AT AT AT AT C AG AT AT T GAT TCCCCCTT CAT t T T T T CGT AT GT T T AC T T T T T T A 
TAT T TAT AT AT C C C T T AGT AAAAAT AC TGGCTCCGC C AC T GC C AGT AAGG TAG T AT T AGT T TGCGTCGC T C AAT AA 
AGTAACATCTATCGTTTATTTTTCATCAACATTAAAAAGGAAGATTCACTATCCACATAGGCATCATCATTATCAA 
AG AAT AT C AGTT C AT AC AT T GT AT AT AT AT AAC T TT CT C AAAT AAACT AACTT T AAAATG AAGT AC AT T AAAAAG G 
AAG AT T C AC TAT C C T T T T AAT AT T T C GT AT AT T T AC T TAT T TAT AT T T T GAT AC TC C T T AG T AAAAAT AC T G G CT C 

30 CGCCACTACC AGT GAT GT AAT ATT AATT CG CGTC C CT CACTAAAGTAAC AC CT ATAAT T TAAT TTTCAT GAAGTC A 
GAGTTAGCATTGGAAAGGGATATAAGCACATGCATTGTGTATATATATATAACTTGCTCAAATAAACTAACTTAAA 
AAT G AAAT TTTACTTTTCC T AGT AC AAT G AA C T AT G CAT C AAT G C GT AAT TAG T T G AG GT C G G C TAT AT G AAT AT G 
T TAT TAAT T T GAAAGC AAAAC AT AAT AACT GAT AG AAG AAT T T T G C AC C T AAAAAT T G AAC T T GAG CT G C T T C AG T 
T AC TAT CT CAT T T T T C AC TAT AT AT G T G T G TAT C AG C TAAT T C T AT GAT T TAAT T AAAC AAAT T GT AAG TAT T AAC 

35 AAAATAACGAATAAATATGGAAAATAAGTACTTGATGAACGTAGGGCCGGAGTTGGCCGAGGTGACCGGAGACAAT 
GGAAAGCAGAGTTACTATTTTTGACTAAATAGCCACAAAAGAATCATTGTTTTTACAATGTAGCAAGTTGGCACGA 
T TAT GAT T C T T G AC AC AAT AG C C AC AT TAT AG AAAG AT AAT G T G G C AC TAAT G AGGT AAT T T T CAT TAT G G AAT G A 
TAACAAACAAAT AAGT ACACGATTAAAACAAACTGAAAGGGTTTTGCGCGATAATTT AAT CATTGTTTTACGAAAA 
TACTCATCAAAATCAAAATATTTATCTGCCTCTGCATGTAAGTTTCATATTTACTCGTCTTGCCATAATTTATATG 

40 AAAAAAT T T AC T C ACAC CG GAT AT AT AT AC C T AA C CAT AG C AAT T AAC TAT G G T AAT GT G AT G TAAT GAAG AG AG A 
ATGTCTATTAATTAATTATGGCTAAGTGATAGTAGTGTATTGTAAACAAATGACGTGCATTTGTTGATTAGACACT 
TACAAAAATACCCACGAAATCTAAAATAATTACAGCCACTATCCACTACTTTCAAATATTATCTGGCCTACCCATT 
AAAATATTTACTCACTCTACCCCTCCAGACTTATATATTATAAGGTATAAAAAGGTAAACAATAATAAATGGTCCT 
CCAGACTTTTATACCATAATTTATGCAGCCTTAAAGGTATACACCTATAAACAAAGGTATACAATAAAAAATGGGT 
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ATGTTGGGTAAATACTTTTAGTTTTATGGGTAGAGTAATTTTTAATGGGTATGACTTGTAAATACTTTAAATTTCA 
TGGGTATAGAGTGTAAAAATTCCTTTGTTGATTGGGTATATACACCCGATGTGGGTAAGTACTTGCCTAATTTTTG 
CCCTAAGGTAAATATAGACTTATAGTATAAAAAAAAATACGCAGTGTGATAGATACTTTGAATTTCATGAGTAATG 
T AAT T T T T AAT AGGT AT AGT AAGG T AAAT AC T T TAT AT T C C AAG GG T AC AC AT T G T AAAAAG C T C AAT AT T T TAT T 
5 CC AAAG AAT AAG AG AC C AAAC AAT G T GT T T GAG ATT TAT T AC T TTGTTGTC C AC C AAAC T AAAAAG AAAAC T T TAG 
AAGTCTAAATTACAATAATCTTAACATGCATTTTACGAATAAATATCACAAAATCTCAAACTATTAGAGATAATGT 
CGT GG AT GAT G T T AAC AT AT T GG AC T AC AC AAC C CAT T G T AC AAT AATT T T G AAG CAT G TAT AT G C ACG AC C AAG A 
CTCCATCATCATAGATCAAATGAATGTTCATTTTAATGCATGAAACCTAAGTAGAACATTTATGCCTTAATGAACT 
AAAACCAAGCAAAAAGATACATCTACTTGTGCAATTGAATGAATTCTACCGTATATACTAATATACACCAGAGGTT 

10 AGT T T AAC AC T T GGAAC T T C AAAAG GT G T AC AAC CAT AG AG T T T C C T T T AC AT T GAT GGTTTCTTT CAT T T C ACT A 
ACTGATAAAATGAAGGCTGGTATAGTCTACCAAATCCCTAGTTCCCTGTGAACTTGCATCCCTTCTAGCTACATGC 
AG AAC AT GT C C T T TAG AT C C CAT AG G T GT AT T G C CAT T T G C C AC T G AAC AAT G G AG G AC AAT G TAT AAT T GT C CT C 
CTCACCCATTGCACATACTCTGTCATTTGCTGCACATCTACATGCCTTTTCTGAATATTCTTCTGAGTCAAATAAG 
CAT CAT GAG AC AT CT GT C AAG TAT C T T T G AAT G GG AT AAT C AC AT T T C C AAAT C G AAAG GT TCTTGTCT T AAC AAG 

15 TCAAGCTGCATCTCGACAAAGAGACTTCGTTGATGAAATGCGACCATAAAGAGCACATGCAAACCAGTTGTTAAAA 
G CAT T GT AC AC AT AT AC CT AC TT CT GAT GT G AAT AAAAG AAAGT C GAT C AAT G AC AG AGG AAAAC AGT C AAT CT AT 
AAC C AC AAAAAT AC T T T T C T T T AAAAGT ACG AC C AC AT AG AT AAC TAT AAT T T C C C G TAG AT G T C AAAC T C T TAT T 
GAATAAAAAATAAACAAACATGTACTATTTGCTCATTTATCCGACTGTCACAAGGATTTTCTTAATGATGGTATAA 
TAG GAG C AAT C C CT T T TAT G AC AG AT G C ACT AAT TTGTTTGGGTG CAT AT T T C AAT G C AG AAC TGT G G GG T AT AT A 

20 AT C T AAAAT AT CAT T C AAAT C AAAC C T G G G AAC GAT T GAG AG AAG AT TAG CAT GGCCTCTG C AC AAG GAT G AC AC G 
CATAAATCGAGAAATGTTCCAAATAAAGGAAATATATATATTACCTGTTTCAATTGGCATAGTTCTTAAAGAAGTT 
TTGGCAGTTAAAGTATTAATAGTTTACCTTGTTTCGATTGTGGGATTTAGCCTTGGGGTTGTCTGGGACGGACCTG 
TGATTATTCTGCTAATCTCCTTGTATATTCATGCAATGTGCAGTTTAATCCAGTGCATTTTGCGTGTTATGGATGG 
ATCC 

25 

[0047] Another suitable nucleic acid molecule in accordance with the present 

invention is isolated from rice and identified herein as Rhprl, which has a nucleotide 
sequence of SEQ ID NO: 20, as follows: 

30 ATGGCGCGCCGCGTCCCTACCCGCCCGCGCGGCGGTGGCGGCGGCGGCGTCCCACGCTCGGAGGGCTCGATCCAAG 
GGCGAGGAGGCCGCGCGGGGGGCAGTGGCGCCGAGGACGCACGCCACGTGTTCGACGAATTGCTCCGGCGTGGCAG 
GGGCGCCTCGATCTACGGCTTGAACCGCGCCCTCGCCGACGTCGCGCGTCACAGCCCCGCGGCCGCCGTGTCCCGC 
. TACAACCGCATGGCCCGAGCCGGCGCCGGCAAGGTAACTCCCACCGTGCACACCTATGCCATCCTCATCGGCTGCT 
GCTGCCGTGCGGGCCGCTTGGACCTCGGTTTCGCGGCCTTGGGCAATGTCGTCAAGAAGGGATTTAGAGTGGATGC 

35 CATCACCTTCACTCCTCTGCTCAAGGGCCTCTGTGCCGACAAGAGGACGAGCGACGCAATGGACATAGTGCTCCGC 
AGAATGACCGAGCTCGGCTGCATACCAGATGTCTTCTCCTACAATAATCTTCTCAAGGGTCTGTGTGATGAGAACA 
GAAGCCAAGAAGCTCTCGAGCTGCTGCACATGATGGCTGATGATCGAGGAGGAGGTAGCCCACCTGATGTGGTGTC 
GTATAACACTGTCCTCAATGGCTTCTTCAAAGAGGGGGATTCAGACAAAGCTTACAGTACATACCATGAAATGCTG 
GACCGGGGGATTTTACCAGATGTTGTGACCTACAGCTCTATTATTGCTGCGTTATGCAAGGCTCAAGCTATGGACA 

40 AAGCCATGGAGGTACTTAACACCATGGTTAAGAATGGTGTCATGCCTGATTGCATGACATATAATAGTATTCTGCA 
TGGATATTGCTCTTCAGGGCAGCCAAAAGAGGCTATTGGAACACTCAAAAAGATGCGCAGTGATGGCGTCGAACCA 
AAT GT T GT T AC T T AT AGT T C AC T GAT G AAT TAT C T T T GC AAG AAT G G AAG AT C C AC C G AAG C T AG AAAG AT T T T C G 
ATTCTATGACCAAGAGGGGCCTAGAGCCTGATATTGCTACCTATCGTACCCTGCTTCAGGGGTATGCTACCAAAGG 
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AGCCCTTGTTGAGATGCATGCTCTCTTGGATTTGATGGTACGAAATGGTATCCAACCGGATCATCATGTATTCAAC 
AT T C T AAT AT GT G C AT ACG C T AAAC AAG AG AAAGT AG AT C AG G C AAT G C T T GT AT T C AG C AAAAT G AG GC AG CAT G 
GATTGAATCCGAATGTAGTGTGCTATGGAACAGTTATAGATGTACTTTGCAAGTCAGGCAGTGTAGATGATGCTAT 
GCTTTATTTTGAGCAGATGATCGATGAAGGACTAACCCCTAACATTATTGTGTATACCTCCCTAATTCATGGTCTG 
5 TGCACCTGTGACAAATGGGACAAGGCTGAAGAGTTAATTCTTGAAATGTTGGATCGAGGCATCTGTCTGAACACTA 
TTTTCTTTAATTCAATAATTGACAGTCATTGCAAAGAAGGGAGGGTTATAGAATCTGAAAAACTCTTTGACTTGAT 
G GT AC G AAT T G G T G T G AAG C C CG AT AT C AT T AC G T AC AAT AC AC T CAT CG AT G GAT G CT G C T T AG CT G GT AAG AT G 
GATGAAGCAACGAAGTTACTTGCCAGCATGGTCTCAGTTGGGGTGAAACCTGATATTGTTACCTATGGCACCTTGA 
TTAATGGCTACTGTAGAGTTAGCAGGATGGATGACGCATTAGCTCTTTTCAAAGAGATGGTGAGCAGTGGTGTTAG 

10 T C CT AAT AT TAT T AC GT AT AAC AT AATT CT G C AAG GT T TAT T T CAT AC C AG AAG AACT G C T G C T G C AAAAG AACT C 
TAT G T C AGT AT T AC C AAAAGT G G AAC AC AG C T T G AAC T TAG C AC GT AC AAC AT AAT C C T T CAT G G ACT TT G C AAAA 
ACAATCTCACTGACGAGGCACTTCGAATGTTTCAGAACCTATGTTTGACGGATTTACAGCTGGAGACTAGGACTTT 
TAACATTATGATTGGTGCCTTACTTAAATGTGGAAGAATGGATGAAGCTAAGGATTTGTTTGCTGCTCACTCGGCT 
AACGGTTTAGTGCCAGATGTTAGGACCTACAGTTTAATGGCAGAAAATCTTATAGAGCAGGGGTCGCTAGAAGAAT 

15 TGGATGATCTATTTCTTTCAATGGAGGAGAATGGCTGTTCCGCCGACTCCCGCATGCTAAATTCCATTGTTAGGAA 
ACT GT T AC AG AG GG GT GAT AT AAC C AGG GC T GG C AC TTACCTGTT CAT GAT T GAT GAG AAG C AC T TCTCCCTC G AA 
GCATCCACTGCTTCCTTCTTGTTAGAATCTTCCCCAATCGTCTGGGAGCAAATATCAAGAATATCACACTTGTCTG 
TAAATTTGAAATTAATTAAGCAGCCCAAATGCACCTGTGAGTTAGGCCCAAAGTGGTCCCAAAATCTGCCTAAACC 
TGGCACAAATTCGGTCGGTAGTGTCGCACAGTTTCACTTATCGCGCGGCGGTTATCGCGCTTACCGCGGGGGTACG 

20 ACGGTTACCGCACTACCGCAGGGTGACGGTAACCCCGGCCCAAACGATAAGGTAAACCCTGGTCGCACAAATTTGG 
CCCAAAACCGACCAGTTATCGCGCTACCGCGGGATGCCTCAGTAGGACCTTAG 

Rhprl is a rice homolog of the Petunia R/-PPR592 gene. 

[0048] The nucleic acid molecule of the present invention which has the 

25 nucleotide sequence of SEQ ID NO: 20 encodes a protein or polypeptide having a 
deduced amino acid sequence corresponding to SEQ ID NO: 21 as follows: 

MARRVPTRPRGGGGGGVPRSEGSIQGRGGRAGGSGAEDARHVFDELLRRGRGASIYGLNRALADVARHSPAAAVSR 
YNRMARAGAGKVT PTVHTYAILIGCCCRAGRLDLGFAALGNWKKGFRVDAITFTPLLKGLCADKRTSDAMDIVLR 

30 RMTELGCIPDVFSYNNLLKGLCDENRSQEALELLHMMADDRGGGSPPDWSYNTVLNGFFKEGDSDKAYSTYHEML 
DRGILPDVVTYSSIIAALCKAQAMDKAMEVLNTMVKNGVMPDCMTYNSILHGYCSSGQPKEAIGTLKKMRSDGVEP 
NWTYSSLMNYLCKNGRSTEARKIFDSMTKRGLEPDIATYRTLLQGYATKGALVEMHALLDLMVRNGIQPDHHVFN 
ILICAYAKQEKVDQAMLVFSKMRQHGLNPNVVCYGTVIDVLCKSGSVDDAMLYFEQMIDEGLTPNIIVYTSLIHGL 
CTCDKWDKAEELILEMLDRGICLNTIFFNSIIDSHCKEGRVIESEKLFDLMVRIGVKPDIITYNTLIDGCCLAGKM 

35 DEATKLIASMVSVGVKPDIVTYGTLINGYCRVSRMDDALALFKEMVSSGVSPNIITYNIILQGLFHTRRTAAAKEL 
YVSITKSGTQLELSTYNIILHGLCKNNLTDEALRMFQNLCLTDLQLETRTFNIMIGALLKCGRMDEAKDLFAAHSA 
NGLVPDVRTYSLMAENLIEQGSLEELDDLFLSMEENGCSADSRMLNSIVRKLLQRGDITRAGTYLFMIDEKHFSLE 
ASTASFLLESSPIVWEQISRISHLSVNLKLIKQPKCTCELGPKWSQNLPKPGTNSVGSVAQFHLSRGGYRAYRGGT 
TVTALPQGDGNPGPNDKVNPGRTNLAQNRPVIALPRDASVGP 



40 
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[0049] Another suitable nucleic acid molecule in accordance with the present 

invention is isolated from rice and identified herein as Rhpr2, which has a nucleotide 
sequence of SEQ ID NO: 22, as follows: 

5 ATGGCGCGCCGCGCCGCTTCCCGCGTCCGCGCCGGCGCTGTTGGCGCCCTTCGCTCGGAGGGCTCGACCCAAGGGC 
GAGGGGGCCGCACGGGGGGCAGTGGCGCCGAGGACGCACGCCACGTGTTCGACGAATTGCTCCGGCGTGGCAGGGG 
CGCCTCGATCTACGGCTTGAACTGCGCCCTCGCCGACGTCGCGCGTCACAGCCCCGCGGCCGCCGTGTCCCGCTAC 
AACCGCATGGCCCGAGCCGGCGCCGACGAGGTAACTCCCAACTTGTGCACCTACGGCATTCTCATCGGTTCCTGCT 
GCTGCGCGGGCCGCTTGGACCTCGGTTTCGCGGCCTTGGGCAATGTCATTAAGAAGGGATTTAGAGTGGACGCCAT 

10 CGCCTTCACTCCTCTGCTCAAGGGCCTCTGTGCTGACAAGAGGACGAGCGACGCAATGGACATAGTGCTCCGCAGA 
ATGACCCAGCTTGGCTGCATACCAAATGTCTTCTCCTACAATATTCTTCTCAAGGGGCTGTGTGATGAGAACAGAA 
GCCAAGAAGCTCTCGAGCTGCTCCAAATGATGCCTGATGATGGAGGTGACTGCCCACCTGATGTGGTGTCGTATAC 
CACTGTCATCAATGGCTTCTTCAAGGAGGGGGATCTGGACAAAGCTTACGGTACATACCATGAAATGCTGGACCGG 
G GG AT T T T AC C AAAT GT T GT TAG C T AC AG C T C T AT TAT T G C T G CG T T AT G C AAGG C T C AAG CT AT GG AC AAAGC C A 

15 TGGAGGTACTTACCAGCATGGTTAAGAATGGTGTCATGCCTAATTGCAGGACGTATAATAGTATCGTGCATGGGTA 
TTGCTCTTCAGGGCAGCCGAAAGAGGCTATTGGATTTCTCAAAAAGATGCACAGTGATGGTGTCGAACCAGATGTT 
GTTACTTATAACTCGCTCATGGATTATCTTTGCAAGAACGGAAGATGCACGGAAGCTAGAAAGATGTTCGATTCTA 
T G AC C AAG AG G G G C CT AAAG C CT G AAAT TACT AC C T AT GGT AC C C T G CT T C AGGG G T AT GC T AC C AAAG G AG CC C T 
TGTTGAGATGCATGGTCTCTTGGATTTGATGGTACGAAACGGTATCCACCCTAATCATTATGTTTTCAGCATTCTA 

20 AT AT GT G CAT AC G C T AAAC AAG G G AAAG TAG AT C AGG C AAT GCTTGTGTT C AG C AAAAT GAG G C AG C AAG GAT T G A 
ATCCGGAT AC AGT GACCT AT GG AAC AGT T AT AGGC AT ACTTTG C AAGT CAGGCAGAGT AGAAGAT GCT ATGCGTT A 
TTTTGAGCAGATGATCGATGAAAGACTAAGCCCTGGCAACATTGTTTATAACTCCCTAATTCATAGTCTCTGTATC 
TTTGACAAATGGGACAAGGCTAAAGAGTTAATTCTTGAAATGTTGGATCGAGGCATCTGTCTGGACACTATTTTCT 
TTAATTCAATAATTGACAGTCATTGCAAAGAAGGGAGGGTTATAGAATCTGAAAAACTCTTTGACCTGATGGTACG 

25 TATTGGTGTGAAGCCCGATATCATTACGTACAGTACTCTCATCGATGGATATTGCTTGGCAGGTAAGATGGATGAA 
GCAACGAAGTTACTTGCCAGCATGGTCTCAGTTGGAATGAAACCTGATTGTGTTACATATAATACTTTGATTAATG 
GCTACTGTA7\AATTAGCAGGATGGAAGATGCGTTAGTTCTTTTTAGGGAGATGGAGAGCAGTGGTGTTAGTCCTGA 
TATTATTACGTATAATATAATTCTGCAAGGTTTATTTCAAACCAGAAGAACTGCTGCTGCAAAAGAACTCTATGTC 
G GG AT T AC C G AAAG TGG AACG C AG C T T G AACT TAG C AC AT AC AAC AT AAT C CT T CAT GGGCTTTG C AAAAAC AAT C 

30 TCACTGACGAGGCACTTCGAATGTTTCAGAACCTATGTTTGACGGATTTACAGCTGGAGACTAGGACTTTTAACAT 
TATGATTGGTGCATTGCTTAAAGTTGGCAGAAATGATGAAGCCAAGGATTTGTTTGCAGCTCTCTCGGCTAACGGT 
T TAG T G C C AG AT GT T AGG AC C T AC AG T T T AAT G G C AG AAAAT C TT AT AG AG C AG G G G T T G C TAG AAG AAT T G GAT G 
ATCTATTTCTTTCAATGGAGGAG7VATGGCTGTACTGCCAACTCCCGCATGCTAAATTCCATTGTTAGGAAACTGTT 
AC AG AG GG G T GAT AT AAC C AG G G C T G G C AC TTACCTGTT CAT GAT T G AT GAG AAG C AC T T C T C C C T CG AAG CAT C C 

35 ACTGCTTCCTTGTTTTTAGATCTTTTGTCTGGGGGAAAATATCAAGAATATCATAGTTGTATTAGAGGAGGGATCT 
TCTCTTTATGTGTAAATAGCGAGGTTCAAGAAAATCATTTGTTGGATTCAGAATCTGGTGTCCATTTTCTTCTTAA 
ATTATTAAATCCTCCAGTGAATCTTGTTGATTCCAAAGCACCATCGATAGGTTCCAAACTTCTTGGAATCAGTAAA 
GTTCAAATGCTTAATGGATCAAATAAGGATTCTGACTGCATTTCAGAGGAAATCCTTTCAAAAGTTGAAGAGATTC 
TCTTAAGCTGTCAAGTGATCAAGTCGCTCGACAAAGATGACAAGAAAACAACAAGGCCAGAACTGTGTCCAAAGTG 

40 GCTTGCTTTGTT G AC AAT GG AAAAT G CAT GCTTGTCTGCTGT T TC AG TAG AGG AG AC T T CT G AC AC AGT GT C C AG A 
GTTGGAGGAAATTTTAAAGAGACATTAAGGGAGATGGGAGGTCTTGATAGTATTTTTGACGTTATGGTGGATTTTC 
ATTCAACATTGGAGAATCTCATAAAGGATACATCCACTTCAGCTTTGGACCGAAATGAAGGAACATCTTTGCAAAG 
TGCTGCTCTCCTCTTGAAATGTTTGAAAATATTGGAAAATGCCATATTTCTAAGCGATGATAACAAGACCCATTTG 
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CTTAATATGAGTAGAAAATTGAACCCGAAACGCTCCTTGCTTTCTTTTGTTGGTGTCATTATCAATACTATTGAGT 
TATTATCAGCTCTTTCAATACTTCAGAATTCTTCTGTTGTTTCCAGCTCTACATATCCGAAATCGTCTAAAGTCTC 
TCAACAGAGTTACTCTGTGGTGATGGCGGGGGGCGACCGTGGCCGAGGCGTGGAGTGCCATCCGCATCAGGGTGTA 
TCGGCCGCGCTGCTCCGCCCTGGTCCGCAGGCTTTGGCGGCGAGCTGGCGGCGGAGGGAGACTGTGGTGAGATCGG 
5 ATTTCGCCGCTGGTGGTGTCGCTACCATGGGGGATTCGCCGCAGGCGCTCTCAGATCGGTTATGCGGGAGCGCAAC 
AAAAGTATGGCGTGGCGGCGCGGAGTGGACGGCCGAGGCGTTCGCGCGGAATGGGGCTGCGGGACCGAGCCAGTCT 
CGCTTGCCGGTAACGCGGAACCGAGCTCAGCACTACATTGCAAAGATTTGGGCAACTCTGACAATTTCCATGTTCT 
AC AAGC TT G AC G T C G AG G G AAT G G AG AAC CT G C C AC C G AAT AGT AG C C C T G C T AT C T AT GT T G C G AAC CAT C AG AG 
TTTTTTGGATATCTATACCCTTCTAACTCTAGGAAGGTGTTTCAAGTTTATAAGCAAGACAAGTATATTTATGTTC 
10 C G AAT TAT T T G A 

Rhprl is a rice homolog of the Petunia R/-PPR592 gene. 

[0050] The nucleic acid molecule of the present invention which has the 

nucleotide sequence of SEQ ID NO: 22 encodes a protein or polypeptide having a 
15 deduced amino acid sequence corresponding to SEQ ID NO: 23 as follows: 

MARRAASRVRAGAVGALRSEGSTQGRGGRTGGSGAEDARHVFDELLRRGRGASIYGLNCALADVARHSPAAAVSRY 
NRMARAGADEVT PNLCTYGILIGSCCCAGRLDLGFAALGNVIKKGFRVDAIAFTPLLKGLCADKRTSDAMDIVLRR 
MTQLGCIPNVFSYNILLKGLCDENRSQEALELLQMMPDDGGDCPPDVVSYTTVINGFFKEGDLDKAYGTYHEMLDR 

20 GILPNWTYSSIIAALCKAQAMDKAMEVLTSMVKNGW 

VTYNSLMDYLCKNGRCTEARKMFDSMTKRGLKPEITTYGTLLQGYATKGALVEMHGLLDLMVRNGIHPNHYVFSIL 
ICAYAKQGKVDQAMLVFSKMRQQGLNPDTVTYGTVIGILCKSGRVEDAMRYFEQMIDERLSPGNIVYNSLIHSLCI 
FDKWDKAKELILEMLDRGICLDTIFFNSIIDSHCKEGRVIESEKLFDLMVRIGVKPDIITYSTLIDGYCLAGKMDE 
ATKLLASMVSVGMKPDCVTYNTLINGYCKISRMEDALVLFREMESSGVSPDIITYNIILQGLFQTRRTAAAKELYV 

25 GITESGTQLELSTYNIILHGLCKNNLT DEALRMFQNLCLTDLQLETRTFNIMIGALLKVGRNDEAKDLFAALSANG 
LVPDVRTYSLMAENLIEQGLLEELDDLFLSMEENGCTANSRMLNSIVRKLLQRGDITRAGTYLFMIDEKHFSLEAS 
TASLFLDLLSGGKYQEYHSCIRGGIFSLCVNSEVQENHLLDSESGVHFLLKLLNPPVNLVDSKAPSIGSKLLGISK 
VQMLNGSNKDSDCISEEILSBCVEEILLSCQVIKSLDKDDKKTTRPELCPKWLALLTMENACLSAVSVEETSDTVSR 
VGGNFKETLREMGGLDSIFDVMVDFHSTLENLIKDTSTSALDRNEGTSLQSAALLLKCLKILENAIFLSDDNKTHL 

30 LNMSRKLNPKRSLLSFVGVIINTIELLSALSILQNSSWSSSTYPKSSKVSQQSYSWMAGGDRGRGVECHPHQGV 
SAALLRPGPQALAASWRRRETWRSDFAAGGVATMGDSPQALS DRLCGSATKVWRGGAEWTAEAFARNGAAGPSQS 
RLPVTRNRAQHYIAKIWATLTISMFYKLDVEGMENLPPNSSPAIYVANHQSFLDIYTLLTLGRCFKFISKTSIFMF 
RII 

35 [0051] Another suitable nucleic acid molecule in accordance with the present 

invention is isolated from rice and identified herein as Rhpr3, which has a nucleotide 
sequence of SEQ ID NO: 24, as follows: 

ATGGCGCGCCGCGCCGCTTCCCGCGCTGTTGGCGCCCTTCGCTCGGACGGCTCGATCCAAGGGCGAGGAGGCCGCG 
40 CGGGGGGCAGTGGCGCCGAGGACGCACGCCACGTGTTCGACGAATTGCTCCGGCGTGGCAGGGGCGCCTCGATCTA 
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CGGCTTGAACCGCGCCCTCGCCGACGTCGCGCGTCACAGCCCCGCGGCCGCCGTGTCCCGCTACAACCGCATGGCC 
CGAGCTGGCGCCGACGAGGTAACTCCCGACTTGTGCACCTACGGCATTCTCATCGGTTGCTGCTGCCGCGCGGGCC 
GCTTGGACCTCGGTTTCGCGGCCTTGGGCAATGTCATTAAGAAGGGATTTAGAGTGGAAGCCATCACCTTCACTCC 
TCTGCTCAAGGGCCTCTGTGCCGACAAGAGGACGAGCGACGCAATGGACATAGTGCTCCGCAGAATGACCGAGCTC 
5 GG T T G CAT AC C AAAT GTCTTCTC C T AC AAT AAT C T T C T C AAC G GG C T GT GT G ATG AG AAC AG AAG C C AAG AAG C T C 
TCGAGTTGCTGCACATGATGGCTGATGATCGAGGAGGAGGTAGCCCACCTGATGTGGTGTCGTATACCACTGTCAT 
C AAT GGCTTCTT C AAAGAG GGGG AT T C AG AC AAAG C T T AC AG T AC AT AC CAT G AAAT G C T GG AC C G GG G G ATT T T A 
C C T G AT GT T G T G AC C T AC AG C T C TAT TAT T GCT G C G T T AT G C AAG G G T C AAG C TAT GG AC AAG C C ATG G AGT C AT T 
GCAAAGAAGGGAGGGTTATAGAATCTGAAAAACTCTTTGACCTGATGGTACGTATTGGTGTGAAGCCTGATATCAT 

10 T AC AT AC AG T AC AC T CAT CG AT G GAT AT T G CT T G G C AG G T AAG AT GG AT G AAG C AAT GAAG T TACT T T C T G G CAT G 
GT C T C AGT T GGG T T G AAAC CT AAT ACT GT T AC T TAT AG C AC T T TG AT T AAT GGCTACTG C AAAAT TAG T AGG AT GG 
AAGACGCGTTAGTTCTTTTTAAGGAGATGGAGAGCAGTGGTGTTAGTCCTGATATTATTACGTATAACATAATTCT 
G C AAG G T T T ATT T C AAAC C AG AAG AAC TGCTGCTG C AAAAG AACT C T AT GT C AG GAT T AC C G AAAGT G G AAC G C AG 
ATT G AACT T AGCACAT AC AAC AT AATCCT T CATG GACTTTG CAAAAACAAACT CACT GATGAT GC ACT T CAGAT GT 

15 T T C AG AAC CT AT GT T TGAT GG AT T T G AAG CT T G AGGCT AG G ACT T T C AAC AT TAT GAT T G ATG CAT TG C T T AAAG T 
TGGCAGAAATGATGAAGCCAAGGATTTGTTTGTTGCTTTCTCGTCTAACGGTTTAGTGCCGAATTATTGGACGTAC 
AG GT T GAT G G CT G AAAAT AT TAT AG G AC AG GGGTTGCT AG AAG AAT T G GAT C AAC TCTTTCTTT C AAT GG AG G AC A 
ATGGCTGTACTGTTGACTCTGGCATGCTAAATTTCATTGTTAGGGAACTGTTGCAGAGAGGAGTAGTGGTGGTGGT 
GAGTGGTGAATCTGCCACCACCCCACCACCAACTCTCAAAATTCTGACATGTGGGATCACTGTCAATCCCTTCTCC 

20 AAG AC AT G T G GG AT C AC T GT C AAT CCCTTCTC C AAAC C AAT T GT G C AG AC AG GT G C T T GC GGT C AG GT T AAAG AAG 
TTGGCAAAAATGCTTCTGAAGAAAGGTTAATTGTTGTTTCATCTCAGGAGATTCCAGATGATCCAGTGTCTCCAAC 
AATTGAGGCGCTTATTTTGCTCCATAGTAAAGCAAGTACACTTGCTGAGAACCACCAGTTGACAACACGGCTTGTT 
G T AC CAT C AAAC AAAGT T GGTT G T AT T C T T G G G GAAG G T GG AAAG G T AAT TACT G AAAT G AGAAG AC GG AC T G G GG 
CTGAAATCCGAGTCTACTCAAAAGCAGATAAACCTAAGTACCTGTCTTTTGATGAGGAGCTTGTGCAGCATATCAG 

25 CCTTATCTTGGTTGATCGGCATGCTGGACGAGCACATCTGTTGTCGCATCAACTGCTGACTGCTATATATGTGCTG 
GTGCTGAATCGATCGATTGTCGTCGCGGAAGTGAAGAACAACCACGGCACTGCTGCCTGCTGGGCTCTAGCCGCCA 
TCAGTTATAACCGTACAAACTTCAGTGATTTGCTGGTTTCACATTGGTTTATAATAAAGGCCTCCGTTTTTAGTTT 
CACGCTGGGCCTTCAGAATCTCAGGACCGGCCCTGCTCATGATCCTTACACCGTGTATCCTGTAGAGTACTTCTCT 
AAAAG AG AGT AC C C T AG T G GAAG TAG C AAAG T T GC AC CAT C T G C T T CAT AC G AAAG AT AT G CAG C AAC T AC T C GC T 

30 TGCCTAATGGAGAACTGCCCTCATCTATTAGTCCTGGTGCCGATTATATGTCCTGCCGTTCTTATCTTGACCAAGT 
AC C T ACT G AT AGGT AC T C T AAT AG GGT T AC AC T AC AAT TAG G C C T C T C GAG AG C C G GG AAT AG T AAT G T G C AAC AA 
T TAG G AAT C AC CAG AG C T G G AAAT T C C AAT GCT TAT G ATT AT AC T G AGG C T G C T G AGC AG AT C CAT G G AC G T GAG G 
ATTACCGAAGACTGTCAGGTCTCACTGGGTATCCAGGTGGCTCTTCGAATTGTGGATTCCAAATAGTTAACTGGAG 
TCTGTCATTGGTGTTGGTGATCTCTGGTGCGAGAGTGAAGTTGCACGAAGCCCATCCTGGTTCTTCCGAGTCCATT 

35 GTGGAGATCCAGGGCATTCCGGATCAAGTGAAAGCCGCACAGAGCCTTCTGCAAGGCTTCATCGGCGCAAGCAGCA 
ACAGCAGGCAGGCGCCCCAGTCCTCTCGCATGGCCCATTATTTTTAG 

Rhpr3 is a rice homolog of the Petunia R/-PPR592 gene. 

[0052] The nucleic acid molecule of the present invention which has the 

40 nucleotide sequence of SEQ ID NO: 24 encodes a protein or polypeptide having a 
deduced amino acid sequence corresponding to SEQ ID NO: 25 as follows: 
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l^lARRAASRAVGALRSDGSIQGRGGRAGGSGAEDARHVFDELLRRGRGASIYGLNRALADVARHSPAAAVSRYNRiy^ 
RAGADEVTPDLCTYGILIGCCCRAGRLDLGFAALGNVIKKGFRVEAITFTPLLKGLCADKRTSDAMDIVLRRMTEL 
GCTPNVFSYNNLLNGLCDENRSQEALELLHMMADDRGGGSPPDWSYTTVINGFFKEGDSDKAYSTYHEMLDRGIL 
PDWTYSSIIAALCKGQAMDKPWSHCKEGRVIESEKLFDLMVRIGVKPDIITYSTLIDGYCLAGKMDEAMKLLSGM 
5 VSVGLKPNTVTYSTLINGYCKISRME DALVLFKEMESSGVSPDI ITYNIILQGLFQTRRTAAAKELYVRITESGTQ 
IELSTYNIILHGLCKNKLTDDALQMFQNLCLMDLKLEARTFNIMIDALLKVGRNDEAKDLFVAFSSNGLVPNYWTY 
RLMAENIIGQGLLEELDQLFLSMEDNGCTVDSGMLNFIVRELLQRGWVWSGESATTPPPTLKILTCGITVNPFS 
KTCGITVNPFSKPIVQTGACGQVKEVGKNASEERLIWSSQEIPDDPVSPTIEALILLHSKASTLAENHQLTTRLV 
VPSNKVGCILGEGGKVITEMRRRTGAEIRVYSKADKPKYLSFDEELVQHISLILVDRHAGRAHLLSHQLLTAIYVL 
10 VLNRSIVVAEVKNNHGTAACWALAAISYNRTNFSDLLVSHWFIIKASVFSFTLGLQNLRTGPAHDPYTVYPVEYFS 
KREYPSGSSKVAPSASYERYAATTRLPNGELPSSISPGADYMSCRSYLDQVPTDRYSNRVTLQLGLSRAGNSNVQQ 
LGITRAGNSNAYDYTEAAEQIHGREDYRRLSGLTGYPGGSSNCGFQIVNWSLSLVLVISGARVKLHEAHPGSSESI 
VE I QG I PDQVKAAQS LLQGF I GAS SN S RQAPQS S RMAH Y F 

15 [0053] Another suitable nucleic acid molecule in accordance with the present 

invention is isolated from rice and identified herein as Rhpr4, which has a nucleotide 
sequence of SEQ ID NO: 26, as follows: 

ATGCCGCTCGCCACGCTGCTCGGCCACCTCGCCGCCGGCCGCTTCGGCCTCGTGCAGGCGCTCACCGGCGCCGCGA 

20 CCGCGGCGGCCGCGCACCGACTCCTCCACCTCCTCCTCCGCACAGCGCCGCCGCCTCCCCTCCCGGACCTCGTCTC 
CCTCGCGCGGTGGTCGCGCGCCCACTTCCGCGCGCCGCTCCCGCTCCGGCTCCACGGGCTCCTCCTCGCCCGCCTC 
GCCTCCAAGGGGCTCTACCCCCTCCTCCGCTCCGAGCTCCACGTCCTCGCCGCGGCGCGCCTCCACTCCCCCGCAT 
CCATCCTCCGCGCTCTCCCCTCCCCGTCCGCGTCCGCGTCCGCATCCACGCCGCTCATCGCCGACATGCTCGTCCT 
CGCCCTCGCCAGGGCATCCCAGCCCCTCAGGGCGTACGACGCGTTCCTCCTCGCCGGGGAGAGCCACCCGCGGCAC 

25 CGCCCCTCCACCTCCTCCGTGAACGCCCTTCTCGCCGGCCTCGTCGGCGCCAAGCGGGTCGACCTCGCCGAGAAGG 
CGTTCAGGAGCGCGCTGCGGCGGCGCGTGTCACCGGACATCTACACCTTCAACACCGTCATCTCCGGCCTCTGCAG 
GATCGGCCAGCTCCGCAAAGCCGGCGATGTCGCCAAGGACATCAAGGCATGGGGTCTGGCTCCCTCTGTGGCCACC 
T AC AAT AG CCT CAT C GAT GG GT AC T G C AAG AAG GG T G GAG C T G GG AAC AT GT AC CAT G T C G ACAT G C T T T T G AAG G 
AG AT GG T C G AAGC C G GG AT C T C AC C G AC T G C AG T T AC AT TTGGTGTGTT GAT C AAT G GG T AT T G C AAG AAC T C G AA 

30 TACTGCGGCCGCAGTGAGAGTCTTCGAGGAGATGAAGCAGCAGGGGATCGCTGCGAGTGTCGTGACGTATAATTCG 
C T AAT T T C AGGT C T CT G C AG T G AGG G T AAGG TG G AG G AAG GG G T G AAG C TG AT G G AGG AG AT GGAG GAT T T G GG G C 
T GT C AC C C AAT G AAAT CACCTTTGGCTGTGTTCT G AAAG G GT T T T G T AAGAAG G G AAT GAT G GC AG ATG C C AAT G A 
TTGGATTGATGGTATGACAGAGAGGAATGTGGAACCTGATGTGGTTATTTACAATATCTTGATCGATGTGTATCGC 
CGT CT T G G AAAAAT GG AGG AT G C AAT G G C G G TG AAG GAG G C AAT GG C AAAG AAG G G GAT C AGT C C C AAT GT C AC AA 

35 CATATAATTGCTTGATAACAGGGTTTAGCCGCAGTGGGGATTGGAGGAGTGCTTCTGGCCTTCTGGATGAGATGAA 
GGAGAAAGGTATTGAAGCAGACGTCGTCACTTACAATGTGCTTATTGGTGCTTTGTGCTGCAAAGGTGAGGTACGG 
AAAG C T GT AAAG CT C T T G GAT G AAAT G T C G G AAGT T G G AT T GG AAC C AAAC CAT CT G AC C T AC AAT ACC AT AAT AC 
AGGGGTTCTGTGATAAGGGTAACATTAAGTCTGCCTATGAAATTAGAACCAGGATGGAAAAATGTCGGAAACGGGC 
AAATGTGGTTACGTACAATGTGTTCATCAAGTATTTCTGCCAGATAGGGAAGATGGATGAAGCTAATGATCTACTC 

40 AAT GAG AT G T T GG AC AAAT GT C TAG T T C C AAAC G G GAT C AC T T AT G AAACG AT AAAAG AGG G GAT G ATG G AAAAAG 
G C T AT AC AC C AG AT AT TAG AG GGTGCACTGT CT C AC AAG C T T C T G AAAAC C C AG CAT CAT C C TG A 
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Rhpr4 is a rice homolog of the Petunia R/-PPR592 gene. 

[0054] The nucleic acid molecule of the present invention which has the 

nucleotide sequence of SEQ ID NO: 26 encodes a protein or polypeptide having a 
deduced amino acid sequence corresponding to SEQ ID NO: 27 as follows: 

5 

MPIATLLGHLAAGRFGLVQALTGAATAAAAHRLLHLLLRTAPPPPLPDL^ 

ASKGLYPLLRSELHVLAAARLHSPASILRALPSPSASASASTPLIADMLVLALARASQPLRAYDAFLLAGESHPRH 
RPSTSSVNALLAGLVGAKRVDLAEKAFRSALRRRVSPDIYTFNTVISGLCRIGQLRKAGDVAKDIKAWGLAPSVAT 
YNSLIDGYCKKGGAGNMYHVDMLLKEMVEAGISPTAVTFGVLINGYCEOSfSNTAAAVRVFEEMKQQGIAASWTYNS 
10 LISGLCSEGKVEEGVKLMEEMEDLGLSPNEITFGCVLKGFCKKGMMADANDWIDGMTERNVEPDVVIYNILIDVYR 
RLGKMEDAMAVKEAMAKKGISPNVTTYNCLITGFSRSGDWRSASGLLDEMKEKGIEADWTYNVLIGALCCKGEVR 
KAVKLLDEMSEVGLEPNHLTYNTIIQGFCDKGNIKSAYEIRTRMEKCRKRANWTYNVFIKYFCQIGKMDEANDLL 
NEMLDKCLVPNGITYETIKEGMMEKGYTPDIRGCTVSQASENPASS 

15 [0055] Another suitable nucleic acid molecule in accordance with the present 

invention is isolated from rice and identified herein as Rhpr5, which has a nucleotide 
sequence of SEQ ID NO: 28, as follows: 

ATGGCTGATGATGGTCGCTGCCCACCTGATGTGGTGTCGTATAATACCATCATTGATGGTCTCTTCAAAGAGGGTG 

20 ATGTGGACAAAGCTTACATCACATACCATGAAATGCTGGACCGGAGGGTTTCTCCAGATGCTGTGACTTACAACTC 
TATCATTGCTGCCTTAAGCAAGGCTCAAGCTATGGACAGGGCCATGGAGGTACTTACAGTGATGGTTATGCCCAAT 
TGCTTCACATATAATAGTATTATGCATGGATATTGTTCTTCAGGACAGTCGGAAAAGGCTATTGGTATTTTCAGAA 
AGATGTGCAGTGATGGTATTGAACCAGATGTTGTTACTTATAACTCGTTGATGGACTATCTCTGCAAGAACGGAAA 
ATGCACAGAAGCCAGAAAGATTTTTGATTCTATGGTCAAGAGGGGTCTCAAGCCTGATATTACTACCTATGGTACC 

25 CTGCTTCATGGGTATGCTTCCAAAGGAGCTCTTGTTGAGATGCATGATCTCTTAGCTTTGATGGTACAAAATGGCA 
TGCAACTTGATCATCATGTCTTCAACATATTAATATGTGCATACACTAAACAAGAAAAAGTAGACGAGGTCGTGCT 
TGTATTCAGCAAAATGAGGCAGCAAGGATTGACTCCGAACGCAGTGAACTATAGAACAGTGATAGATGGACTTTGC 
AAG T T AG G TAG AC TAG AT GAT G C TAT G C T T AAT T T T GAG C AG AT GAT T G AT AAAG G AC T G AC ACCT AACG T T G TT G 
TTTATACCTCCCTAATTCATGCTCTCTGTACCTATGACAAATGGGAGAAGGCCGAGGAGTTAATTTTTGAAATATT 

30 G GAT C AAGG T AT C AAT C C C AAC AT TGTGTTTTT T AAT AC AAT AT T GG AC AG T C T T T G C AAAG AAGG G AGG G T TATA 
GAATCTAAAAAACTCTTTGACCTGTTGGGACATATTGGTGTGAATCCTGATGTCATTACATACAGTACACTCATCG 
ATGGATATTGCTTAGCTGGTAAGATGGATGGAGCAATGAAGTTACTCACTGGCATGGTCTCAGTTGGGTTGAAACC 
TGATAGTGTTACATATAGCACTTTGATTAATGGTTACTGTAAAATTAATAGAATGGAGGACGCATTAGCTCTTTTC 
AAGGAGATGGAAAGCAATGGTGTTAATCCTGATATTATTACATATAACATAATTCTGCATGGTTTATTTCGCACCA 

35 GAAGAACTGCTGCTGCAAAAGAACTATATGCCAGGATTACCGAAAGTGGAACGCAGCTTGAACTTAGCACATACAA 
CATAATCCTCATGGACTTTGCAAAAACAAACTCACTGATGATGCACTTCGGATGTTTCAGAACCTATGTTTGA 

Rhpr5 is a rice homolog of the Petunia R/-PPR592 gene. 
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[0056] The nucleic acid molecule of the present invention which has the 

nucleotide sequence of SEQ ID NO: 28 encodes a protein or polypeptide having a 
deduced amino acid sequence corresponding to SEQ ID NO: 29 as follows: 

5 MADDGRCPPDVVSYNTIIDGLFKEGDVDKAYITYHEMLDRRVSPDAVTYNSIIAALSKAQAMDRAMEVLTVKV-iyiPN 
CFTYNSIMHGYCSSGQSEKAIGIFRKMCSDGIEPDWTYNSLMDYLCKNGKCTEARKIFDSMVKRGLKPDITTYGT 
LLHGYASKGALVEMHDLLALMVQNGMQLDHHVFNILICAYTKQEKVDEWLVFSKMRQQGLTPNAVNYRTVIDGLC 
KLGRLDDAMLNFEQMIDKGLTPNVWYTSLIHALCTYDKWEKAEELIFEILDQGINPNIVFFNTILDSLCKEGRVI 
ESKKLFDLLGHIGVNPDVITYSTLIDGYCLAGKMDGAMKLLTGMVSVGLKPDSVTYSTLINGYCKINRMEDALALF 
10 KEMESNGVNPDIITYNIILHGLFRTRRTAAAKELYARITESGTQLELSTYNIILMDFAKTNSLMMHFGCFRTYV 

[0057] Another suitable nucleic acid molecule in accordance with the present 

invention is isolated from rice and identified herein as Rhpr6, which has a nucleotide 
sequence of SEQ ID NO: 30, as follows: 

15 

ATGGCGCGCCGCGCCGCTTCCCGCGCTGTTGGCTCGGAGGGCTCGATCCAAGGGCGAGGGGGCCGCGCGGGGGGCA 
ATGGCGCCGAGGACGCACGCCACGTGTTCGACGAATTGCTTCGGCGTGGCAAGGGCGCCACGATCTACGGCTTGAA 
CCGCGCCCTCGACGACGTCGCGCGTCACAGCCCCGCGGCCGCCGTGTCCCGCTACAACCGCATGGCCCGAGCCGGC 
GCCGACGAGGTAACTCCCAACTTGTACACCTACAGCGTTCTCATCGGTTGCTGCTGCCGGGCGGGCCGCTTGGACC 

20 TCGGTTTCGCGGCCTT GGG C AAT GT C AT T AAG AAGG G AT T TAG AG T G G AAG C CAT C AC CTTCACTCCTCTG CT C AA 
GGGCCTCTGTGCC G AC AAG AGG AC GAG C G AC G C AAT GG AC AT AGT G C T C T GC AG AAT G AC C C AG CTCGGCTG CAT A 
CCAAATGTCTTCTCCTGCACCATTCTTCTCAAGGGTCTGTGTGATGAGAACAGAAGCCAAGAAGCTCTCGAGCTGC 
T C C AAAT GAT G C C T G ATG AT GG AGGT G AC T G C C C AC C T G AT GTGGTGTTG T AC AAC AC C G T CATC AAT GGCTTCTT 
CAAAGAGGGGGATCCGGACAAAGCTTACGCTACATACCATGAAATGTTTGACCAGGGGATTTTGCCAGATGTTGTG 

25 ACTTACAGCTCTATTATCGCTGCCTTATGCAAGGCTCAAGCTATGGACAAGGCCATGGAGGTACTTAACACCATGG 
T T AAG AAT G G T GT CAT GC C T AAT T G C AG G AC AT AT AAT AGT ATT G T G C AC G GAT AT T G C T C T T C AG G G C AGT T G AC 
AGAGGCTATTGGATTTCTCAAAATGATGTGCAGTGATGGTGTCGAACCAGATGTTGTTACTTGTAACTTGCTGATG 
GATTATCTTTGCAAGAACAGAAGATGCACGGAAGCTAGAAAGATTTTCAATTCTATGACCAAGTGTGGCCTAAAGC 
CTGATATTACTACCTATTGTACCCTGCTTCAGGGGTATGCTACCAAAGGAGCCCTTGTTGAGATGCATGATCTCCT 

30 GGATTTGATGGTATGGAACGGTATCCAACCTAATCATCATGTATTCAACATTCTAATATGTGCATACGCTAAACAA 
GAAAAAGTAGATGAGGCGATGCTTGTATTCAGCAAAATGAGGCAGCAAGGATTGAGTCCGAATGCAGTGAACTACA 
G AAC AG T CAT AG AT G T AC T C T G C AAG CT AGG C AG AG TAT AC GAT G C AG T G C T T AC CT T AAAG C AG AT GAT C AAT G A 
AG G AC T AAC C C C T G AC AT CAT T GT AT AT AC C C C C C T AAT T C ATG GT TTTTGTACCTGT G AC AAAT G G GAG AAG G C T 
GAGGAGTTAATTTTTTAA 

35 

Rhpr6 is a rice homolog of the Petunia R/-PPR592 gene. 

[0058] The nucleic acid molecule of the present invention which has the 

nucleotide sequence of SEQ ID NO: 30 encodes a protein or polypeptide having a 
deduced amino acid sequence corresponding to SEQ ID NO: 31 as follows: 
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MARRAAS RAVGSEG S I QGRGGRAGGNGAE DARH V FDELLRRGKGAT I YGLNRAL D D VARH S PAAAVS RY1SJRMARAG 
ADEVTPNLYTYSVLIGCCCRAGRLDLGFAALGNVIKKGFRVEAITFTPLLKGLCADKRTSDAMDIVLCRMTQLGCI 
PNVFSCTILLKGLCDENRSQEALELLQMMPDDGGDCPPDWLYNTVINGFFKEGDPDKAYATYHEMFDQGILPDW 
5 TYSSIIAALCKAQAMDKAMEVLNTMVKNGVMPNCRT^ 

DYLCKNRRCTEARKIFNSMTKCGLKPDITTYCTLLQGYATKGALVEMHDLLDLMVWNGIQPNHHVFNILICAYAKQ 
EKVDEAMLVFSKMRQQGLSPNAVNYRTVIDVLCKLGRVYDAVLTLKQMINEGLTPDIIVYTPLIHGFCTCDKWEKA 
EELIF 

10 [0059] Another suitable nucleic acid molecule in accordance with the present 

invention is isolated from rice and identified herein as Rhpr7, which has a nucleotide 
sequence of SEQ ID NO: 32, as follows: 

ATGGCACGCCGCGTCGCTGCCCGCGCCCGCGCCCGCGCCGGCGGCGTCCCGCGCTCGGAGGGTACGATCCAAGACC 

15 GAGCACGCGTTGGGAGCGGTGGCGCCGAGGACGCACTCGACGTGTTCGACGAATTGCTCCGGCGAGGCATCGGCGC 
TCCGATCCGCAGCTTGAACGGCGCTCTCGCCGACGTCGCGCGCGACAACCCCGCGGCCGCTGTGTCCCGCTTCAAC 
CGCATGGCACGAGCTGGTGCCAGCATGGTAACTCCCACCGTGCACACCTATGGCATCCTCATCGGCTGCTGCTGCA 
GTGCGGGCCGCTTAGACCTCGGTTTCGCGGCCTTGGGCCATGTCGTTAAGAAGGGATTCAGAGTGGAACCCATCAT 
CT T T AAT CCTCTGCT C AAGG G CCTCTGTG C AG AC AAG AG G ACG G AC GACGC AAT G G AC AT AG TG C T C CGT GG AAT G 

20 ACCGAGCTCAGCTGCGTGCCAAATGTCTTCTCCCACACCATTATTCTCAAGGGACTCTGTCATGAGAACAGAAGCC 
AAG AAG CT C T C GAG C T G C T C C AC AT GAT GG CT G AT GAT G G AGG AGG C T G CT T AC C T AAT G T T GT GT CAT AC AG C AC 
CGTCATCGATGGCCTCTTGAAAGGAGGGGATCCGGACAAAGCCTACGCTACATACCGTGAAATGCTTGACCGGAGG 
ATTTTGCCAAATGTTGTGATTTACAGCTCCATTATTGCTGCCCTATGCAAGGGTCAAGCAATGGACAAGGCCATGG 
AGGTACACGATAGGATGGTTAAGAATGGAGTTACACCCAATTGCTTCACGTATACTAGTCTTGTGCATGGATTTTG 

25 CTCTTCAGGGCAGTTGACAGAGGCTATTAAATTTCTAGAAAAGATGTGCAGCAATGGTGTTGAACCAAATGTTGTT 
ACT TAT AG C T C G T T TAT G G AC TAT C T CT G C AAG AAC GG AAG AT G C AC AG AAG C T AG AAAG AT T T T T GAT T CT AT G G 
T C AAG AG G G G C C T AAAG C CT G AT AT T ACT AC CT AC AGT AG CT T AC T T CAT G G G TAT G C TAT C G AAG GAG C T C T T G T 
TGAGATGCATGGTCTCTTTGATTTGATGGTACAAAGTGATATGCAACCCGATCATTATGTCTTCAACACACTAATA 
TATGCATCCGCCAAGCAAGGAAAAGTAGATGAGGCCATGCTTGTATTTAGCAAAATGAGGCAGCAAGGATTGAAAC 

30 CT AAT TGTGTTACG TAT AG C AC T T T GAT T AAT G G C T AC T G T AAAAT T AC T AGG ATG G AG AAT GC T T T AG C AC T T T T 
CCAAGAGATGGTGAGCAATGGTGTTAGTCCTAATTTTATCACATATAACATAATGCTGCAAGGTTTATTTCGTACA 
GGAAGAACTGCTACTGCAAAAGAATTCTATGTACAGATTATCAAAAGTGGCAAAAAAGATCTTATAGAACAGGGGT 
TGCTAGAAGAATTGGATGATCTATTTCTTTCAATGGAGGACAATGACTGTAGTACTGTGTCGACTCCTGCATGCTA 
A 

35 

Rhpr7 is a rice homolog of the Petunia R/-PPR592 gene. 

[0060] The nucleic acid molecule of the present invention which has the 

nucleotide sequence of SEQ ID NO: 32 encodes a protein or polypeptide having a 
deduced amino acid sequence corresponding to SEQ ID NO: 33 as follows: 



40 
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MARRVAARARARAGGVPRSEGTIQDRARVGSGGAEDALDVFDELLRRGIGAPIRSM 

R^1?^RAGASMVTPTVHTYGILIGCCCSAGRLDLGFAALGHWKKGFRVEPIIFNPLLKGLCADKRTDD7^MDIVLRGM 
TELSCVPNVFSHTIILKGLCHENRSQEALELLHMMADDGGGCLPNWSYSTVIDGLLKGGDPDKAYATYREMLDRR 
ILPNWIYSSIIAALCKGQAMDKAMEVHDRMVKNGVTPNCFTYTSLVHGFCSSGQLTEAIKFLEKMCSNGVEPNVV 
TYSSFMDYLCKNGRCTEARKIFDSMVKRGLKPDITTYSSLLHGYAIEGALVEMHGLFDLMVQSDMQPDHYVFNTLI 
YASAKQGKVDEAMLVFSKMRQQGLKPNCVTYSTLINGYCKITRMENALALFQEMVSNGVSPNFITYNIMLQGLFRT 
GRTATAKEFYVQI IKSGKKDLIEQGLLEELDDLFLSMEDNDCSTVSTPAC 

[0061] Another suitable nucleic acid molecule in accordance with the present 

invention is isolated from rice and identified herein as Rhpr8, which has a nucleotide 
sequence of SEQ ID NO: 34, as follows: 

ATGGCGCGCCGCGCCGCTTCCCGCGCTGCTGGCGCCCTTCGCTCGGAGGGCTCGATCCAAGGGCGAGGGGGCCGCG 
CGGGGGGCAGTGGCGGTGGCGCGGAGGACGCACGCCACGTGTTCGACGAATTGCTCCGTCGTGGCATACCAGATGT 
CTTCTCCTACAATATTCTTCTCAACGGGCTGTGTGATGAGAACAGAAGCCAAGAAGCTCTCGAGTTACTGCACATA 
ATGGCTGATGATGGAGGTGACTGCCCACCTGATGTGGTGTCGTACAGCACCGTCATCAATGGCTTCTTCAAGGAGG 
GGG AT C T GG AC AAAAT GC T T G AC C AG AG GAT TT C G C CAAAT G T T GT G AC C T AC AAC T CT AT T AT T G C T G C G C T AT G 
CAAGGCTCAAACTGTGGACAAGGCCATGGAGGTACTTACCACCATGGTTAAGAGTGGTGTCATGCCTGATTGCATG 
AC AT AT AAT AG T ATT GT G CAT GGG T TTTGCTCTT C AGGG C AG C C G AAAG AGG C TAT T GT AT T T CT C AAAAAG AT G C 
GCAGTGATGGTGTCGAACCAGATGTTGTTACTTATAACTCGCTCATGGATTATCTTTGCAAGAACGGAAGATGCAC 
GGAAGCAAGAAAGATTTTTGATTCTATGACCAAGAGGGGCCTAAAGCCTGATATTACTACCTATGGTACCCTGCTT 
CAGGGGTATGCTACCAAAGGAGCCCTTGTTGAGATGCATGGTCTCTTGGATTTGATGGTACGAAACGGTATCCACC 
CT AAT CAT TAT G T T T T C AG CAT T C T AGT AT GT G CAT AC G C T AAAC AAG AG AAAGT AG AAG AG G C AAT G C T T GT AT T 
CAGCAAAATGAGGCAGCAAGGATTGAATCCGAATGCAGTGACCTATGGAACAGTTATAGATGTACTTTGCAAGTCA 
GGTAGAGTAGAAGATGCTATGCTTTATTTTGAGCAGATGATCGATGAAGGACTAAGACCTGACAGCATTGTTTATA 
ACTCCCTAATTCATAGTCTCTGTATCTTTGACAAATGGGAGAAGGCTGAAGAGTTATTTCTTGAAATGTTGGATCG 
AGGCATCTGTCTTAGCACTATTTTCTTTAATTCAATAATTGACAGTCATTGCAAAGAAGGGAGGGTTATAGAATCT 
GGAAAACTCTTTGACTTGATGGTACGAATTGGTGTGAAGCCCGATATCATTACCCTTGGCAGGTTTTTGGGGAGCG 
CAAGGCGCGACTACTCACTGTTCGTCT^ACATCTACTTCATCTTCACCAACATGTCGAACACTGGAGACAAGGAGAA 
GGAGACTCCCGTCAACACCAACGGAGGCAATACTGCCTCAAACTCCAGCGGAGGACCATTCTTGGGCACATACAAC 
ATAATCCTTCATGGACTTTGCAAAAACAAACTCACTGATGATGCACTTCGAATGTTTCAGAACCTATGTTTGATGG 
ATTTGAAGCTTGAGGCTAGGACTTTCAACATTATGATTGATGCATTGCTTAAAGTTGGCAG7VAATGATGAAGCCAA 
GGATTTGTTTGTTGCTTTCTCGTCTAACGGTTTAGTGCCGAATTATTGGACGTACAGATTGATGGCTGAAAATATT 
ATAGGACAGGGGTTGCTAGAAGAATTGGATCAACTCTTTCTTTCAATGGAGGACAATGGCTGTACTGTTGACTCTG 
GCATGCTAAATTTCATTGTTAGGGAACTGTTGCAGAGAGGTGAGATAACCAGGGCTGGCACTTACCTTTCCATGAT 
TGATGAGAAGCACTTTTCCCTCGAAGCATCCACTGCTTCCTTGTTTATAGATCTTTTGTCTGGGGGAAAATATCAA 
G AAT AT CAT AT AT TTCTCCCT G AAAAAT AC AAG T C C TT TAT AG AAT C T T T G AGC T G C TG A 
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[0062] The nucleic acid molecule of the present invention which has the 

nucleotide sequence of SEQ ID NO: 34 encodes a protein or polypeptide having a 
deduced amino acid sequence corresponding to SEQ ID NO: 35 as follows: 

5 MARRAASRAAGALRSEGSIQGRGGRAGGSGGGAEDARHVFDELLRRGIPDVFSYNILLNGLCDENRSQEALELLHI 
l^DDGGDCPPDWSYSTVINGFFKEGDLDKMLDQRISPNWTYNSIIAALCKAQTVDKAMEVLTTMVKSGVMPDCM 
TYNSIVHGFCSSGQPKEAIVFLKKMRSDGVEPDVVTYNSLMDYLCKNGRCTEARKIFDSMTKRGLKPDITTYGTLL 
QGYATKGALVEMHGLLDLMVRNGIHPNHYVFSILVCAYAKQEKVEEAMLVFSKMRQQGLNPNAVTYGTVIDVLCKS 
GRVEDAMLYFEQMIDEGLRPDSIVYNSLIHSLCIFDKWEKAEELFLEMLDRGICLSTIFFNSIIDSHCKEGRVIES 
10 GKLFDLMVRIGVKPDIITLGRFLGSARRDYSLFVNIYFIFTNMSNTGDKEKETPVNTNGGNTASNSSGGPFLGTYN 
1 1 LHGLCKNKLT DDALRMFQNLCLMDLKLEART FN IMI DALLKVGRNDEAKDLFVAFS SNGLVPN YWT YRLMAEN I 
IGQGLLEELDQLFLSMEDNGCTVDSGMLNFIVRELLQRGEITRAGTYLSMIDEKHFSLEASTASLFIDLLSGGKYQ 
EYHIFLPEKYKSFIESLSC 

15 [0063] Another suitable nucleic acid molecule in accordance with the present 

invention is isolated from rice and identified herein as Rhpr9, which has a nucleotide 
sequence of SEQ ID NO: 36, as follows: 

ATGGCGAGAAGAGGACGACGATACTGTAGAGCAGAGGGAACAGAGGAGCGGGGCACTGGTGCTCCGGTGGCTGGGA 

20 GGTGGCGACGTCGGCGGCCGAATGTGTTCCCGAGCGCCGCGCTGGAGAGCCCCGAGCTGCGACGGCATCACGCCGA 
CTACCGGCCGTGGGCGGCGCACATGGAGGCAAAGCCGGTCTACTTCGCGTCGAGGCGTGCCTCCGGGCGGCCCGAG 
CTGCAGCAGCAGCTCGTCCGGCCCACCCCAATCTGGGCCGATTGGGCCGATCTCAGCCTCCCGGAGCGGAGACCGA 
TCTGGGCCGTCCATCCGCGCCGCCCAGCCAATCGGACGGTGGGTGTATTACTGTACTGCCAGGTCGGTGACCCTCC 
GCCGCCGGCGGCGGCGGCGGCGGCGGCAGGCATGGCGCGCCGTGTCACCACCCTTACCCGCGCCCGCACCCGCGCC 

25 CGCGGCGGCGGCGTCCCCAGCGCGCAGGGTGGTACGACCCAAGACCTAGGGCGCGCGGGGGGCAGTGGCACCGAGG 
GCGCACGCCACGTGCTCGACGAATTGCCGCTACGGGGCTGGGGCGCCTCGATCTACAGCTTCAACCGCACCCTCAC 
CGACGTCGCGCGTGACAGCCCAGCCGCAGCAGTTTCGCTCTTCAACCGCATGGCCCGAGCCGGCGCCGACGAGGTA 
ACTCCCGACTTGTGCACCTACAGCATTCTCATCGGTTGCTGCTGCCGCGCGGGCCGCTTGGACCTCGGTTTCGCGG 
CCTTGGGCAATGTCATTAAGAAGGGATTTAGAGTGGAAGCCATCACCTTCGCTCCTCTGCTCAAGGGCCTCTGTGC 

30 CGACAAGAGGACGAGCGACGCAATGGACATAGTGCTCCGCAGAATGACCGAGCTCAGCTGCATGCCAGATGTTTTC 
TCCTGCACCATTCTTCTCAAGGGTCTGTGTGATGAGAACAGAAGCCAAGAAGCTCTCGAGCTGCTGCACATGATGG 
CTGATGATCGAGGAGGAGGTAGCCCACCTGATGTGGTGTCGTATACCACTGTCATCAATGGCTTCTTCAAAGAGGG 
GGATTCAGACAAAGCTTACAGTACATACCATGAAATGCTTGATCGGAGGATTTCACCAAATGTTGTGACCTACAGC 
TCTATTATTGCTGCGTTATGCAAGGCTCAAGCTATGGACAAAGCCATGGAGGTACTTAACACCATGGTTAAGAATG 

35 GTGTCATGCCTGATTGCATGACATATAATAGTATTCTGCATGGATATTGCTCTTCAGGGCAGCCAAAAGAGGCTAT 
TGGAACACTCAAAAAGATGCGCAGTGATGGCGTCGAACCAAATGTTGTTACTTATAGATCACTGATGAATTATCTT 
T G C AAG AAT G G AAG AT G C AC C GAAG C T AG AAAG AT T T T C GAT T C T AT G AC C AAG AG GG G C C T AG AG C C TG AT AT T G 
CTACCTATCGTACCCTGCTTCAGGGGTATGCTACCAAAGGAGCCCTTGTTGAGATGCATGCTCTCTTGGATTTGAT 
GGATCCTGAGTTCTACAAGTATTTGGAGAAGTGA 

40 
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[0064] The nucleic acid molecule of the present invention which has the 

nucleotide sequence of SEQ ID NO: 36 encodes a protein or polypeptide having a 
deduced amino acid sequence corresponding to SEQ ID NO: 37 as follows: 

5 MARRGRRYCRAEGTEERGTGAPVAGRWRRRRPNVFPSAALESPELRRHHADYRPWAAHMEAKPVYFASRRASGRPE 
LQQQLVRPTPIWADWADLSLPERRPIWAVHPRRPANRTVGVLLYCQVGDPPPPAAAAAAAGMARRVTTLTRARTRA 
RGGGVPSAQGGTTQDLGRAGGSGTEGARHVLDELPLRGWGASIYSFNRTLTDVARDSPAAAVSLFNRMARAGADEV 
TPDLCTYSILIGCCCRAGRLDLGFAALGNVIKKGFRVEAITFAPLLKGLCADKRTSDAMDIVLRRMTELSCMPDVF 
SCTILLKGLCDENRSQEALELLHMMADDRGGGSPPDWSYTTVINGFFKEGDSDKAYSTYHEMLDRRISPNVVTYS 
10 SIIAALCKAQAMDKAMEVLNTMVKNGVM^ 

CKNGRCTEARKIFDSMTKRGLEPDIATYRTLLQGYATKGALVEMHALLDLMDPEFYKYLEK 

[0065] Another suitable nucleic acid molecule in accordance with the present 

invention is isolated from rice and identified herein as RhprlO, which has a nucleotide 
15 sequence of SEQ ID NO: 38, as follows: 

ATGCCCTTCCGACCGCGCCTCCCGCTCCCCCTCCTCCTCCTCCTCCTGCCTCACCTCCGGCGCCGCCGCTCCTCCC 
CCCGCCCGCCCGTGCCCGCGTGGAGACCGCTCTCGTATTATCCCTCGGCGGCGGCCGCGGCGGCGGAGGTGACGGA 
GTCCGAGGAGGACGCGGCGGCTGTTGGCAGGGACACCCGAGCCCCTCCCTCCATCGGCGGGATTGCACGGGGAGCG 

20 CCTAGGGTTGGCTGCAATGGCGGGGGGGCTGCCGATGACGAGGAGGTCGAGAGGAAGGCCCGCGCTGTCGCGCGGA 
TCAAGCTCTGCCATGAGCTTCTGCGGGAGAGGAGGTGGCGCGCGATGCGGGCAGCCTTGGCGCAGCTGGTGACTGA 
GCAAGGTGAGCATGCTATGAATTTTCCCCATTCTGATTATCAACTCTACTCATGTGGTATCTGAATAACTATGGTG 
AT T GGT GT G AGG AGG C G T AG G AAT G G CAT C GGT AG T T T T G AAC T T C T GAT C GAT AT G AAT G T G T G AC AC AGG AT AT 
AT TGTTTTTC C AG AG G CAT TAT C AAT TG AT CAT T AC C AT AT AAAAAAC AG T AAG AAAAGG GT C G AAAG C AAT G CAT 

25 ACATAGTTGTATTTGGTGTAGTATTATTACTGTAATTCGTTTTTTACTAGAAGGTCTCTGCAAGTATGACAAACTA 
GTAACATAAAAATTGTTCGCGTTTAATCTTATTGCGCTTCCTGCTGTAGGATCTGGGTCTGCAGCTGCTCTCTGTG 
AC AT C T T AT GG AAC AG AT T C AG AG AGTG T G AT T C C AAC G GT T G T G TAT G G G ATG C T C T AG C G AAC AG T T ATG CT AG 
AG C T C AG AT GG T T CAT GAT G C CCT T T AC G T T C T T AG T AAAAT GAG C AG C C T AAAC AT G C AAAT CTCGGTGTT C AC C 
TATGACAGTTTATTGCACGGCTTAAGGATGACAGACGTGGCATTGGAGCTTTTTGAAGAAATGGAGTCTTGTGGTG 

30 TCTCTCCCAGTGAATATTCGCATAGTATTATTATTAATGGCCTCTGTAAGCAAGATAAGGTTGGAGAAGCTTTATC 
TTTCCTTCAGGAAGCTAGGAAGGAGGGAAAGTTTAAACCCTTGGGAATGACCTTTAACATTCTTATGTCTGCATTG 
TGTAATTGGGGGTTTGTTCAGTCTGCAAAATCATTTTTATGCCTGATGCTGAAATATGGATTAGTCCCTGACAGGT 
ATACCTTTTCTACCCTTATACACGGTCTATGTAAAGTAGGTTCAATGGAGGAAGCATTGGATCTTTTCGAGAGAGT 
GACAAAAGAAGGAATGGAACTTGAGATTGTGACCTACAATAGCCTTATCAATGGGTACCGATTGCTTGGTTTAACA 

35 AAAGAAAT T C CT AAAAT CAT C C AG AT GAT GAG AG GC C AAGG T G T T G AAC C T GAT C T T GT T AC AT AT AC TAT ACT T A 
TTGCTGGTCACTGCGAAAGTGGTGATGTTGAAGAAGGAATGAAGGTAAGGAAGGATGTCCTAGACCAAGGTTTGCA 
GTTGAATATTGTCACATATAGTGTCCTTCTCAATGCTCTCTTCAAAAAAGGCATGTTCTGCGAAATTGACAACCTA 
CTCGGCGAGATCTACAATATTGGTTTGGATATGGATGTTATCGCATATTCCATCCTTATCCATGGGTATTGCAAGC 
T AGGGGAAAT T GAAAAGGCT CTT CAAGT AT GT AATG CAATGT GCAGTT CT CAGAGGGT AATGCC AACAT C ACTGAA 

40 CCATTTTTCTATTCTTCTAGGACTTTGCAAGAAAGGATTGTTAGTTGAAGCAAGGTGGTATTTGGAAAATGTAGCT 
AGAAAATATCAGCCAACTGATGTAGTGTTCTATAATGTCGTTATTGATGGTTATGCAAAACTTGGTGATATTGTAA 
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ATGCTGTTCGTTTGTATGATCAGATCACTGTAGCTGGTATGCACCCAACCATTGTCACATGCAATTCTCTTCTATA 
TGGGTATTGTAAAATTGGGGATCTGCAACTTGCCGAGAGCTATTTTAGGGCTATTCAGCTAAGTGGACTTCTACCA 
ACAGCAGTGACATACACTACCTTGATGGATGCACTCTCTGAAGCTGGAGAAGTTAATACCATGCTAAGTCTTTTTG 
AT G AAAT GG T T G C AAAG AG GAT C AAGG C AAAT G C AGT AAC T T AC AG T G T CAT T GT T AAAG G G CT T T GT AAG CAG C T 
5 CAG AT T T GAT GAG G C T AT C AAT GT T CT C AAAG AT AT G GAT AG C AAAGGT AT T AAT G C T G AC C C G AT AAC T T AC AAT 
ACCCTTATACAAGGTTTCTGTGAATCAGAAAACGTTCAGATGGCTTTCCACATACATGACATCATGTTATGCCGTG 
GCCTTGTGCCGACACCTGTTACTTATAACTTGCTTATTAATGTGCTGTGTTTGAAGGGAAAAGTTATTCAAGCAGA 
AAT ACT T T T G G AGT C C C T C AG AG AAAAT G G CAT T AAGT T G AG AAAAT T T G CGT AC AC AAC AC T TAT CAAAG CT CAG 
TGCGCAAAAGGAATGCCTATCAATGCTGTTTTGTTAGTTGGTAAGCTTCTAGATGCAGGATTTGAAGCTTCTATTG 
10 AAGATTTCAGTGCAGCAATCAATCGACTTTGCAAAAGACAATTTGCCAAAGAAGCCTTTATGTTTGTCCCGATTAT 
GCTATCTGTTGGTATTTACCCAGATACTCAAATATATTGTGTGCTAGGCAGAGCTCTGCAGAAAAATAGTGAGCTT 
GTCTATCTACCCATATTAAATGCACTTGCTGTTAAAACTGGTATTTAA 

RhprlO is a rice homolog of the Petunia R/-PPR592 gene. 
15 [0066] The nucleic acid molecule of the present invention which has the 

nucleotide sequence of SEQ ID NO: 38 encodes a protein or polypeptide having a 
deduced amino acid sequence corresponding to SEQ ID NO: 39 as follows: 

MPFRPRLPLPLLLLLLPHLRRRRSSPRPPVPAWRPLSYYPSAAAAAAEVTESEEDAAAVGRDTRAPPSIGGIARGA 
20 PRVGCNGGGAADDEEVERKARAVARIKLCHELLRERRWRAMRAALAQLVTEQGSGSAAALCDILWNRFRECDSNGC 
VWDALANSYARAQMVHDALYVLSKMSSLNMQISVFTYDSLLHGLRMTDVALELFEEMESCGVSPSEYSHSIIINGL 
CKQDKVGEALSFLQEARKEGKFKPLGMTFNILMSALCNWGFVQSAKSFLCLMLKYGLVPDRYTFSTLIHGLCKVGS 
MEEALDLFERVTKEGMELEIVTYNSLINGYRLLGLTKEIPKIIQMMRGQGVEPDLVTYTILIAGHCESGDVEEGMK 
VRKDVLDQGLQLNIVTYSVLLNALFKKGMFCEIDNLLGEIYNIGLDMDVIAYSILIHGYCKLGEIEKALQVCNAMC 
25 SSQRVMPTSLNHFSILLGLCKKGLLVEARWYLENVARKYQPTDWFYNWIDGYAKLGDIVNAVRLYDQITVAGMH 
PTIVTCNSLLYGYCKIGDLQLAESYFRAIQLSGLLPTAVTYTTLMDALSEAGEVNTMLSLFDEMVAKRIKANAVTY 
SVIVKGLCKQLRFDEAINVLKDMDSKGINADPITYNTLIQGFCESENVQMAFHIHDIMLCRGLVPTPVTYNLLINV 
LCLKGKVIQAEILLESLRENGIKLRKFAYTTLIKAQCAKGMPINAVLLVGKLLDAGFEASIEDFSAAINRLCKRQF 
AKEAFMFVPIMLSVGIYPDTQIYCVLGRALQKNSELVYLPILNALAVKTGI 

30 

[0067] Another suitable nucleic acid molecule in accordance with the present 

invention is isolated from Arabidopsis thaliana, which has a nucleotide sequence of 
SEQ ID NO: 40, as follows: 

35 ATGAAGGCTTTGAGATTGATTCAGCCTCATCTCTTGAAGACAGGTAGTCTTAGAACTGATTTGCTCTGTACCATTT 
CGAGTTTCTTTTCTAGCTGCGAACGAGACTTTTCAAGTATTAGCAATGGGAATGTCTGTTTCAGAGAGAGATTGAG 
AAGTGGTATTGTTGATATTAAGAAAGATGATGCTATTGCTCTGTTCCAAGAAATGATTAGGTCTCGTCCTCTTCCT 
AGTCTTGTTGATTTCAGTAGATTCTTTAGTGCCATTGCCAGAACAAAACAGTTCAATCTCGTGTTAGATTTCTGCA 
AGCAACTGGAATTGAATGGGATTGCTCATAACATCTACACTTTGAATATCATGATCAACTGCTTTTGCCGGTGTTG 

40 TAAAACTTGTTTTGCTTATTCTGTTTTGGGAAAAGTAATGAAGCTTGGGTATGAGCCTGACACAACCACGTTTAAC 
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ACTCTGATCAAAGGACTCTTTCTTGAGGGTAAAGTGTCTGAAGCTGTGGTTTTAGTCGATAGGATGGTGGAAAACG 
GATGTCAACCTGATGTGGTTACTTATAATTCGATTGTAAATGGGATATGTAGATCAGGAGATACTTCTTTGGCCTT 
GGATTTGCTCAGAAAGATGGAAGAAAGAAATGTTAAGGCTGATGTGTTTACTTACAGTACAATCATTGATAGTCTT 
T G T AG AGAT G GT T G CAT AG AC G CT G C AAT TAG C C T T T T C AAGG AAAT GG AG AC G AAAGG G AT T AAAT CT AG T GT T G 
5 T T AC GT AT AAT TCTCTTGT GAG AG G T CT T T GT AAAG C CGG T AAAT GG AAT GAT G GGG C AC T GT T GT T G AAG G AT AT 
GGTGAGTAGGGAAATCGTCCCTAATGTCATCACTTTCAATGTATTACTTGATGTTTTTGTCAAAGAAGGGAAGCTT 
C AGG AGGC T AAT G AAT T G T AC AAAG AG AT GAT C AC AAG AG G TAT AT C AC C T AAT AT T AT T AC T TAT AAT AC C T TG A 
TGGATGGGTATTGTATGCAGAACCGTCTTAGTGAGGCCAACAATATGTTGGATCTTATGGTTAGGAATAAGTGCAG 
TCCTGATATCGTGACTTTTACAAGTCTCATCAAAGGATATTGTATGGTGAAAAGAGTTGACGATGGTATGAAGGTC 

10 TTCCGCAATATTTCTAAGAGAGGCTTGGTTGCCAATGCAGTTACTTATAGCATTCTTGTCCAAGGGTTTTGTCAAT 
CCGGGAAAATAAAGCTCGCAGAGGAACTTTTCCAAGAAATGGTTTCACACGGTGTTCTTCCTGATGTTATGACGTA 
TGGTATTTTGCTTGATGGCTTGTGTGACAATGGGAAGCTTGAAAAGGCATTGGAAATTTTTGAGGATTTACAAAAG 
AGT AAG AT G GAT C T T G GT AT T GTT AT GT AT AC AAC CAT CAT CG AG GGG AT GT G C AAG GG T G G AAAAG T GG AAG AT G 
C C T GG AAT T TAT TCTGTAGC C T AC C T T GT AAAG G AGT G AAG CCT AAT G T TAT G AC AT AC AC CG T GAT GAT T T C AGG 

15 AT TAT GT AAG AAAGGGT C AC T GT C T G AAG C AAAC AT CT T G C TT AGAAAAAT GG AGG AAG AT G G G AAT G C G C C AAAT 
GATTGTACATACAACACACTAATCCGGGCACATCTCCGAGATGGTGACTTAACTGCATCAGCTAAACTTATTGAAG 
AAAT G AAG AGT TGTGGGTTCT C AG C AG AT G CT T C C AGT AT T AAG AT G GT T AT C G AT AT GT TAT T G AGT G G T G AAT T 
GGACAAAAGCTTTCTAGATATGCTTTCGTAA 

20 SEQ ID NO: 40 is a Arabidopsis homolog of the Petunia Rf-PPR592 gene. 

[0068] The nucleic acid molecule of the present invention which has the 

nucleotide sequence of SEQ ID NO: 40 encodes a protein or polypeptide having a 
deduced amino acid sequence corresponding to SEQ ID NO: 41 as follows: 

25 MKALRLIQPHLLKTGSLRTDLLCTISSFFSSCERDFSSISNGNVCFRERLRSGIVDIKKDDAIALFQEMIRSRPLP 
SLVDFSRFFSAIARTKQFNLVLDFCKQLELNGIAHNIYTLNIMINCFCRCCKTCFAYSVLGKVMKLGYEPDTTTFN 
TLIKGL FLEGKVSE AVVLVDRMVENGCQ PD WT YN S I VNG I CRSGDT S LALDLLRKMEERNVKADVFT YS T 1 1 DS L 
CRDGCIDAAISLFKEMETKGIKSSWTYNSLVRGLCKAGKWNDGALLLKDMVSREIVPNVITFNVLLDVFVKEGKL 
QEANELYKEMITRGISPNIITYNTLMDGYCMQNRLSEANNMLDLMVRNKCSPDIVTFTSLIKGYCMVKRVDDGMKV 

30 FRNISKRGLVANAVTYSILVQGFCQSGKIKLAEELFQE^4VSHGVLPDVMTYGILLDGLCDNGKLEKALEIFEDLQK 
SKMDLGIVMYTTIIEGMCKGGKVEDAWNLFCSLPCKGVKPNVMTYTVMISGLCKKGSLSEANILLRKMEEDGNAPN 
DCTYNTLIRAHLRDGDLTASAKLIEEMKSCGFSADASSIKMVIDMLLSGELDKSFLDMLS 

[0069] Another nucleic acid molecule in accordance with the present 

35 invention has a nucleotide sequence of SEQ ID NO: 42, identified herein as Rf- 
PPR591, as follows: 

1 AT AT AT AT AT AC AAAC T GAT TTTTTCTGTC TAT T T G C AC AGT GTTATTTT T AC AT ACC C T T G AAAAAG GG T AGC T C CGCT 

81 AAT AAT GT T ATC T T T ACAAAAAAT AAC AAT AC T T T T T T T AC AT AAT AT AT AC AAAAC T CAT T CT T AT G T AT T GT AAAT AT 
40 161 GATAAAAATATTGTTATTTTTTGTAATATAGCTATTAGGTAGTCATGTTGTGTAAATTTTCCTAAAAATATTTACCTGAG 
241 TCGGCCATTTGGCTAAAAATATTTCATTTTATAGTCGCATATACTCCAAGCTTGTATATCCCAGAGCGACAGTATACTTC 
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321 AATGGTATACGATATCTTCTATTATACACCTTTTTAGTATTCGTATACCCCCAATAGTATACAAGTTTGACACCGCAATA 

4 01 G T GT ACCC C AAT GTT G T G GT T G T GT GGC T AC AAAAT T T GAG T GAT G GT AG T G TAAT T T T T T AGT GT AAGC T GG G T AGT T T 

481 TGAAAACATTCTTTTTGAAAATTGTAGTTCAAGTCATAGTACAAAAAACTGAAATATTTATATGTTTCTTGATTCTGGCT 

561 GGTCTTCTAAAAATTTTGAAATGCTGGCTAGTTTTCATTTAGCGAGGGGCAAATAGGCTACATGGCCAAATTTTTACGTT 

5 641 AAAAGATAAGTGTTGTCTGGGAAAGTATTCGAAAAGATTGTAGGGACAAGTGTTGCCTAGACAACACGTCAAATTATGTA 

721 G AAAAAT G CAC G AAG AAATT C AAAAGC AAAT AT T GC T T AAGC AAG AGGC AG T C AAAGAC AAGTGC T GC C T T AGGG AGT G A 

801 GAAATGGGCATCACTATAAGATTGTATTTCCATCCGATATTTATTCATTATAAACTTAAGGAAAAGTGCAGGAAAAACCA 

881 CTAGTTTTTAGCTATTTTTTGGAACTTTAATCATTATGGGCTGAACTTCAGACTTTGTGGGCCGAACTTCATACATTCGC 

961 AAGTAAAAATTTAGCTCACAGGCCACTTTTACCACTAGTATTTGGTTTGAAGTCATTTTTTATTGGTTTTACATGAGAGA 

10 1041 CCACTTTTTGGAACTTCAATCTTTGTGCGCTTGAACTTCATGCCTAAGTTATTAAGTTCAACTTCAATCCGTAAGGGCTG 

1121 AAT TT T T AGG C AT AG AT GCGT AAAC T T C AACCT T G T GG AC T G AAGT T G AAC TTCGCCCCT T ATGGT GG C C T G AAGT T G AA 

1201 C T T C AAT C CT TGTGGGCT G AAC T T GT GT G AAG T T C AAC CC AC AAGG AT T AAAGT T T C AAAAAAT G ACC T C T C AAGC AAAA 

1281 TCTGTAAAAAAAAGTGGTCTCTCATGCACTTTTACCCATTCGCAAAGTAGGCTGAAGTTCAGCCCACAATTATTCAAGTT 

1361 CCAAAAAATTTCACAATATATACCTCCTTATCTCGGTTATGATCTTTTGTATGATTTAGCAAAATGGACGGGGAAAGTGC 

15 14 41 AC G AAAG ACC AC T T T T GC CAT T GGT C T T T G GGT AC AGG CC AC TAAT ACC AAAAT AT T T AGT T TGT G GC TACT T T T G C T T A 

1521 AAGAGTTGAACTTCAGTCCAGAGGCCGGATTGAAGTTCAGTCCTTAAAGATTGAACTTCGATCCAGTGCCATATGGACTG 

1601 AAGTTCAGTCAAGTCCTTAAGATGGAACTTCAGTCCAGAGCCATATGGACTGAAGTTCAGTCAATTATCAGAACTTAAGT 

1681 C AAT AT T TAT T T AGT AAAGG CC C AAAAG T G GT T AGT AT AAG ACC AAT AAAAAT AGC GGCC TAAAAC T AAAT AAC AG T G T T 

17 61 AAAAGTGGCTGATGGACGAAATTTCTACAAAATGGACTCGAGGTAGCAATTCAACTTCAACCTATGGTGTCATAGTTGTA 

20 1841 CAATTCTTCCAATCACCCCTACTAAGTGAAGTGAAGCGAAGATGATGAGAATTTCAGTGCGTTACTGTCTCAATGGTAAT 

1921 CCCTTTTTCTCATTCTTTGCTTATTCAATTGCACCCCGACATTATTCTACCAATACATGTTCCATTTCAGTTAAAGGGAA 

2001 TTTTGGGGTTTCTAATGAATTTCAGAATGTTAAGTGTTTAGATGATGCTTTCAGTTTGTTCCGTCAAATGGTTAGAACTA 

2081 AGCCTCTTCCTTCTGTTGCCTCTTTCTCTAAATTGTTGAAAGCTATGGTACATATGAAGCATTACTCTTCTGTTGTTTCT 

2161 CTTTTTCGAGAAATCCACAAATTACGAATTCCTGTTCATGAATTCATCTTGAGCATTGTGGTTAACAGTTGTTGCCTTAT 

25 22 41 GCATCGTACCGATCTCGGATTTTCTGTATTAGCCATTCACTTCAAGAAAGGCATTCCATATAATGAAGTCACCTTTACTA 

2321 CCTTAATAAGGGGACTTTTTGCTGAAAATAAGGTCAAAGATGCTGTTCATTTGTTCAAAAAGTTGGTGAGGGAGAATATA 

24 01 TGTGAGCCTAATGAAGTCATGTATGGAACGGTCATGAATGGGCTTTGCAAAAAGGGCCATACTCAAAAAGCTTTTGATTT 

2481 GCTCCGGTTAATGGAACAAGGAAGCACTAAGCCCAATACACGCACTTACACCATTGTCATAGACGCCTTTTGCAAAGATG 

2561 G G AT GC T AG AT G GTGC T ACC AG C C T T T T GAAT G AGAT G AAAC AAAAAAG CAT TC CT CCC G AC AT T T T T ACT TAT AG CAC T 

30 2 641 T TAAT T G ATG C T T TGT G T AAGT T AAG T C AG T GGG AAAAT G T T AG G AC T TTGTTCCTT GAG AT GAT AC AT CT TAAT AT T T A 

2721 TCCAAATGTGTGCACCTTCAACTCCGTCATTGATGGACTATGCAAAGAGGGGAAAGTAGAAGACGCTGAGGAAATAATGA 

2801 GAT AC AT GAT T G AAAAAG GT GT AG AC C C T G AT GT GAT CAC C TAT AAT AT GAT AAT T G AC GG AT AT GGC TTGCGTGGT C AA 

2881 G T GG AT AG AG C ACGGG AAAT T T T T GAT T CC AT GAT C AAT AAG AG CAT T G AGC CC GAT AT TAT T AGC TAT AAT AT AC TAAT 

2961 AAAT GG AT AT G C C AGG C AAAAG AAAAT AG ACG AG G C AAT G C AAG TCTGCCGT G AAAT T T C T C AAAAGGGAT T G AAAC C T A 

35 3041 G TAT T GTT AC C T GC AAT GTTCTCTTG CAT GGTCTTTTT G AAC T T GG AAG AAC TAAATCT G CAC AAAAT T T CT T T GAT GAG 

3121 ATGCTATCTGCGGGGCACATCCCTGATTTATACACTCATTGTACTTTGCTTGGTGGTTATTTTAAGAATGGACTTGTTGA 

3201 AG AG G C T ATGT CAC AC T T CC AT AAGT TGG AAAGAAG GAG AG AAG AT AC AAAT AT T C AAAT T T AC AC G G C T GT CAT T GAT G 

3281 GAT TG T GC AAAAATGG T AAGCT CG AC AAAGCT C AT G C T AC G T T T GAG AAG CTTCCCTT GAT AGG C T T AC AT CC T GAT G T G 

3361 ATAACATACACTGCAATGATTAGTGGATATTGTCAAGAAGGGTTGTTAGATGAAGCTAAAGATATGCTAAGGAAAATGGA 

40 34 41 GGACAATGGTTGTTTGGCAGACAACCGAACATACAATGTTATTGTGCGGGGATTTCTCAGAAGCAATAAAGTTAGTGAAA 

3521 TGAAGGCTTTTCTGGAGGAAATAGCTGGGAAGAGCTTCTCATTTGAGGCAGCTACTGTAGAGTTATTGATGGATATTATA 

3601 GCAGAGGATCCATCCATAACACGCAAAATGCACTGGATTAAACTGCACATTGCATGAATATACAAGGAGATTAGCAGAAT 

3 681 AAT CAC AG GT C C GT CC C AG AC AAC C C C AAG GC T AAAT C C CAC AAT C G AAAC AAG GT AAAC T AT T AAT AC TT T AACT GC C A 

37 61 AAACT T C T TT AAG AAC T ATGC C AAT T G AAACAGG T AAT AT AT AT AT T T C C T T T AT T T GG AAC AT T T C T C GAT T T AT GC GT 

45 3841 GTCATCCTTGTGCAGAGGCCATGCTAATCTTCTCTCAATCGTTCCCAGGTTTGATTTGAATGATATTTTAGATTATATAC 

3921 CCCACAGTTCTGCATTGAAATATGCACCCAAACAAATTAGTGCATCTGTCATAAAAGGGATTGCTCCTATTATACCATCA 

4001 T T AAG AAAAT C C T TGT G AC AGT C GG AT AAATG AG C AAAT AGT AC AT GT T T GT TT AT T T T T TAT T C AAT AAG AG T T T G AC A 
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4 081 TCTACGGGAAATTATAGTTATCTATGTGGTCGTACTTTTAAAGAAAAGTATTTTTGTGGTTATAGATTGACTGTTTTCCT 

4161 C T GT CAT T GATCG ACT T T CT T T TAT T C AC AT C AG AAGT AG GT AT AT GT G T AC AAT GCT T T T AAC AACT GGTT T G C ATG T G 

4241 CTCTTTATGGTCGCATTTCATCAACGAAGTCTCTTTGTCGAGATGCAGCTTGACTTGTTAAGACAAGAACCTTTCGATTT 

4321 G G AAAT GT GATT AT CC CAT T C AAAG AT AC T T G AC AG AT GT CT C AT G AT GC T TAT T T G AC T C AGAAG AAT ATT C AGAAAAG 

5 4 4 01 G CAT GT AG AT GT GC AG C AAATG AC AG AGT AT GT GCAAT GG GT G AGG AGG AC AAT T AT ACAT T GT C CT C CATT G T T C AG T G 

4481 GCAAATGGCAATACACCT ATGGGATCTAAAGGACATGTTCTGCATGTAGCTAGAAGGGATGCAAGTTCACAGGGAACTAG 

4561 GGATTTGGTAGACTATACCAGCCTTCATTTTATCAGTTAGTGAAATGAAAGAAACCATCAATGTAAAGGAAACTCTATGG 

4 641 TTGTACACCTTTTGAAGTTCCAAGTGTTAAACTAACCTCTGGTGTATATTAGTATATACGGTAGAATTCATTCAATTGCA 

4721 CAAGTAGATGTATCTTTTTGCTTGGTTTTAGTTCATTAAGGCATAAATGTTCTACTTAGGTTTCATGCATTAAAATGAAC 

10 4801 ATTCATTTGATCTATGATGATGGAGTCTTGGTCGTGCATATACATGCTTCAAAATTATTGTACAATGGGTTGTGTAGTCC 

4 881 AATATGTT7^ACATCATCCACGACATTATCTCTAATAGTTTGAGATTTTGTGATATTTATTCGTAAAATGCATGTTAAGAT 

4961 TATTGTAATTTAGACTTCTAAAGTTTTCTTTTTAGTTTGGTGGACAACAAAGTAATAAATCTCAAACACATTGTTTGGTC 

5041 TCTTATTCTTTGGAATAAAATATTGAGCTTTTTACAATGTGTACCCTTGGAATATAAAGTATTTACCTTACTATACCTAT 

5121 TAAAAATTACATTACTCATGAAATTCAAAGTATCTATCACACTGCGTATTTTTTTTTATACTATAAGTCTATATTTACCT 

15 5201 TAGGGCAAAAATTAGGCAAGTACTTACCCACATCGGGTGTATATACCCAATCAACAAAGGAATTTTTACACTCTATACCC 

5281 ATGAAATTTAAAGTATTTACAAGTCATACCCATTAAAAATTACTCTACCCATA7y\ACTAAi\AGTATTTACCCAACATACC 

5361 CAT T T T T TAT TG T AT ACC T T T GTT T AT AGGTGT AT AC CT T T AAG GC TGC AT AAAT T AT GGT AT AAAAG TCTGG AGG ACC A 

54 41 TTTATTATTGTTTACCTTTTTATACCTTATAATATATAAGTCTGGAGGGGTAGAGTGAGTAAATATTTTAATGGGTAGGC 

5521 C AGAT AAT AT T T GAAAGT AG T GGAT AGT GGC T GT AAT TAT T T TAG AT T T C GT GG GT AT T T T T GT AAGT GT CT AAT C AAC A 

20 5601 AATGCACGTCATTTGTTTACAATACACTACTATCACTTAGCCATAATTAATTAATAGACATTCTCTCTTCATTACATCAC 

5681 ATTACCATAGTTAATTGCTATGGTTAGGTATATATATCCGGTGTGAGTAAATTTTTTCATATAAATTATGGCAAGACGAG 

57 61 TAAATATGAAACTTACATGCAGAGGCAGATAAATATTTTGATTTTGATGAGTATTTTCGTAAAACAATGATTAAATTATC 

5841 GC G C AAAAC C CT T T C AGT T T GT TT T AAT CGT GT AC T TAT TTGTTTGT TAT CAT T CC AT AAT G AAAAT T AC CT C AT T AGT G 

5921 CCACATTATCTTTCTATAATGTGGCTATTGTGTCAAGAATCATAATCGTGCCAACTTGCTACATTGTAAAAACAATGATT 

25 6001 CTTTTGTGGCTATTTAGTCAAAAATAGTAACTCTGCTTTCCATTGTCTCCGGTCACCTCGGCCAACTCCGGCCCTACGTT 

6081 CATCAAGTACTTATTTTCCATATTTATTCGTTATTTTGTTAATACTTACAATTTGTTTAATTAAATCATAGAATTAGCTG 

6161 ATACACACATATATAGTGAAAAATGAGATAGTAACTGAAGCAGCTCAAGTTCAATTTTTAGGTGCAAAATTCTTCTATCA 

62 4 1 GTTATTATGTTTTGCTTTCAAATTAATAACATATTCATATAGCCGACCTCAACTAATTACGCATTGATGCATAGTTCATT 

6321 GTACTAGGAAAAGTAAAATTTCATTTTTAAGTTAGTTTATTTGAGCAAGTTATATATATATACACAATGCATGTGCTTAT 

30 64 01 ATCCCTTTCCAATGCTAACTCTGACTTCATGAAAATTAAATTATAGGTGTTACTTTAGTGAGGGACGCGAATTAATATTA 

64 81 CAT C AC T G GT AG T G GC GG AGCC AGT AT T T T T ACT AAGG AGT AT C AAAAT AT AAAT AAGT AAAT AT AC G AAAT AT T AAAAG 

6561 GATAGTGAATCTTCCTTTTTAATGTACTTCATTTTAAAGTTAGTTTATTTGAGAAAGTTATATATATACAATGTATGAAC 

6641 T GAT AT T C T T TG AT AAT GAT G ATGC C T AT GT G GAT AGT G AAT CT T C CT T T T T AAT GTT GAT G AAAAAT AAA 



35 R/-PPR591, isolated from Petunia, has an open reading frame ("ORF") of 1776 bp, 
extending between nucleotides 1882-3657, which is homologous to that of Rf- 
PPR592. 

[0070] The nucleotide sequence of SEQ ID NO: 42 encodes a protein or 

polypeptide having a deduced amino acid sequence of SEQ ID NO: 43, as follows: 

40 

MMRISVRYCLNGNPFFSFFAYSIAPRHYSTNTCSISVKGNFGVSNEFQNVKCLDDAFSLFRQMVRTKPLPSVASFS 
KLLKAMVHMKHYSSWSLFREIHKLRIPVHEFILSIWNSCCLMHRTDLGFSVLAIHFKKGIPYNEVTFTTLIRGL 
FAENECVKDAVHLFKKLVRENICEPNEVMYGTVMNGLCKKGHTQKAFDLLRLMEQGSTKPNTRTYTIVIDAFCKDGM 
LDGATSLLNEMKQKSIPPDIFTYSTLIDALCKLSQWENVRTLFLEMIHLNIYPNVCTFNSVIDGLCKEGKVEDAEE 
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IMRYMIEKGVDPDVITYNMIIDGYGLRGQVDRAREIFDSMINKSIEPDIISYNILINGYARQKKIDEAMQVCREIS 
QKGLKPSIVTCNVLLHGLFELGRTKSAQNFFDEMLSAGHIPDLYTHCTLLGGYFKNGLVEEAMSHFHKLERRREDT 
NIQIYTAVIDGLCKNGKLDKAHATFEKLPLIGLHPDVITYTAMISGYCQEGLLDEAKDMLRKMEDNGCLADNRTYN 
VIVRGFLRSNKVSEMKAFLEEIAGKSFSFEAATVELLMDIIAEDPSITRKMHWIKLHIA 

5 

[0071] Yet another nucleic acid molecule in accordance with the present 

invention has a nucleotide sequence of SEQ ID NO: 44, identified herein as rf- 
PPR592, as follows: 

10 ATGATGAGAATTGCAGTGCGTTACTGTCTCAATGGTAATCCCTTTTTCTCATTCTTTGCTTATTCAATTGCACCCC 
G AC AT TAT T C T AC C AAT AC AC GT T C CAT T T C AGT T AAAG G G AAT TTTGGGGTTTC T AAT G AAT T T GAG AAT GT T AA 
GTGTTTAGATGATGCTTTCAGTTTGTTCCGTCAAATGGTTAGAACTAAGCCTCTTCCTTCTGTTGTCTCTTTCTCT 
AAATTGTTGAAAGCTTTGGTACATATGAAGCATTACTCTTCTGTTGTTTCTCTTTTTCGAGAAATCCACAAATTAC 
GT ATT C C T GT T CAT G AAT T CAT CT T GAG CAT T GT G G T T AAC AGT T GT T G C CT TAT G CAT C GT AC CG AT CT C GG AT T 

15 TTCTGTATTAGCCATTCACTTCAAGAAAGGTATTCCATTTAATCAAGTTATCTTTAACACCTTACTAAGGGGACTC 
T T T G C T G AAAAT AAG GT T AAAG AT G C T GT T C ATT T G T T C AAAAAG T T G G T G AGG G AG AAT AT AT GT G AG C C T AAT G 
AAGTCATGTATGGAACGGTCATGAATGGGCTTTGCAAAAAGGGCCATACTCAAAAAGCTTTTGATTTGCTCCGGTT 
AATGGAACAAGGAAGTACTAAGCCCAATACATGTATCTATAGCATTGTTATCGATGCCTTTTGCAAAGATGGGATG 
CTAGATGGTGCTACCAGCCTTTTGAATGAGATGAAACAAAAAAGCATTCCTCCCGACATTTTTACTTATAGCACTT 

20 T AAT T GAT G C T T T GT GT AAGT T AAGT C AG T G GGAAAAT G T T AGG AC T TTGTTCCTT GAG AT GAT AC AT C T T AAT AT 
TTATCCAAATGTGTGCACCTTCAACTCCGTCATTGATGGACTATGCAAAGAGGGGAAAGTAGAAGACGCTGAGGAA 
AT AAT GAG AT AC AT GAT T G AAAAAGGT GT AG AC C C T GAT G T GAT C AC CT AT AAT AT GAT AAT T G AC GG AT AT G G C T 
TGCGTGGTCAAGTGGATAGAGCACGGGAAATTTTTGATTCCATGATCAATAAGAGCATTGAGCCC7VATATTATTAG 
CTATAATATACTAATAAATGGATATGCCAGGCAAAAGAAAATAGACGAGGC7VATGCAAGTCTGCCGTGAAATTTCT 

25 CAAAAGGGATTGAAACCTAGTATTGTTACCTGCAATGTTCTCTTGCATGGTCTTTTTGAACTTGGAAGAACTAAAT 
CTGCACAAAATTTCTTTGATGAGATGCTATCTGCGGGGCACATACCTGATTTATACACTCATTGTACTTTGCTTGG 
TGGTTATTTTAAGAATGGACTTGTTGAAGAGGCTATGTCACACTTCCATAAGTTGGAAAGAAGGAGAGAAGATACA 
AATATTCAAATTTACACGGCTGTCATTGATGGATTGTGCAAAAATGGTAAGCTCGACAAAGCTCATGCTACGTTTG 
AGAAGCTTCCCTTGATAGGCTTACATCCTGATGTGATAACATACACTGCAATGATTAGTGGATATTGTCAAGAAGG 

30 GTTGTTAGATGAAGCTAAAGATATGCTAAGGAAAATGGAGGACAATGGTTGTTTGGCAGACAACCGAACATACAAT 
GT T AT T GT G C GG GG AT T T C T C AG AAG C AAT AAAGT T AG T G AAAT G AAG GCTTTTCTG G AG G AAAT AG C T GG G AAG A 
GCTTCTCATTTGAGGCAGCTACTGTAGAGTTATTGATGGATATTATAGCAGAGGATCCTTCTTTGCTTAACATGAT 
TCCAGAATTTCACCGGGATAATAAGAAGTGA 

35 rf-PPR592 is a gene homologous to R/-PPR592 and is isolated from a non-restoring 
Petunia line. 

[0072] The nucleotide sequence of SEQ ID NO: 44 encodes a protein or 

polypeptide having a deduced amino acid sequence of SEQ ID NO: 45, as follows: 
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MMRIAVRYCLNGNPFFSFFAYSIAPRHYSTNTRSISVKGNFGVSNEFENVKCLDDAFSLFRQMVRTKPLPSWSFS 
KLLKALVHMKHYSSVVSLFREIHKLRIPVHEFILSIWNSCCLMHRTDLGFSVLAIHFKKGIPFNQVIFNTLLRGL 
FAENKVKDAVHLFKKLVRENICEPNEVMYGTVMNGLCKKGHTQKAFDLLRLMEQGSTKPNTCIYSIVIDAFCKDGM 
LDGATSLLNEMKQKSIPPDIFTYSTLIDALCKLSQWENVRTLFLEMIHLNIYPNVCTFNSVIDGLCKEGKVEDAEE 
5 IMRYMIEKGVDPDVITYNMIIDGYGLRGQVDRAREIFDSMINKSIEPNIISYNILINGYARQKKIDEAMQVCREIS 
QKGLKPSIVTCNVLLHGLFELGRTKSAQNFFDEMLSAGHIPDLYTHCTLLGGYFKNGLVEEAMSHFHKLERRREDT 
NIQIYTAVIDGLCKNGKLDKAHATFEKLPLIGLHPDVITYTAMISGYCQEGLLDEAKDMLRKMEDNGCLADNRTYN 
VIVRGFLRSNKVSEMKAFLEEIAGKSFSFEAATVELLMDIIAEDPSLLNMIPEFHRDNKK 

10 [0073] Also suitable in the present invention are other forms of the nucleic 

acid molecules shown above. An example of a nucleic acid suitable in the present 
invention is a nucleic acid molecule which hybridizes to a nucleotide sequence of 
from nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, 
SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, 

15 SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, or SEQ ID NO: 
40 under stringent conditions of a hybridization buffer comprising 20% formamide in 
0.9M saline/0.09M SSC buffer at a temperature of 42°C. 

[0074] For the purposes of defining the level of stringency, reference can 

conveniently be made to Sambrook et aL, Molecular Cloning: A Laboratory Manual, 

20 2nd Edition, Cold Spring Harbor, NY, Cold Spring Harbor Laboratory Press, at 1 1 .45 
(1989) which is hereby incorporated by reference in its entirety. An example of low 
stringency conditions is 4-6X SSC/0. 1-0.5% w/v SDS at 37°-45° C for 2-3 hours. 
Depending on the source and concentration of the nucleic acid involved in the 
hybridization, alternative conditions of stringency may be employed such as medium 

25 stringent conditions. Examples of medium stringent conditions include 1-4X 
SSC/0.25% w/v SDS at > 45° C for 2-3 hours. An example of high stringency 
conditions includes 0.1-1X SSC/0. 1% w/v SDS at 60 C for 1-3 hours. The skilled 
artisan is aware of various parameters which may be altered during hybridization and 
washing and which will either maintain or change the stringency conditions. For 

30 example, another stringent hybridization condition is hybridization at 4X SSC at 65° 
C, followed by a washing in 0. IX SSC at 65° C for about one hour. Alternatively, an 
exemplary stringent hybridization condition is in 50% formamide, 4XSSC, at 42° C. 
Still another example of stringent conditions include hybridization at 62° C in 6X 
SSC, .05X BLOTTO, and washing at 2X SSC, 0.1% SDS at 62° C. 
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[0075] The isolated nucleic acid molecule of the present invention can be 

from petunia, Arabidopsis thaliana, or rice. 

[0076] The present invention also relates to an isolated protein encoded by the 

isolated nucleic acid molecule of the present invention which restores fertility to 
cytoplasmic male sterile plants and modifies expression of toxic mitochondria 
proteins by the plant. 

[0077] The present invention also relates to an isolated expression system that 

contains the nucleic acid molecule of the present invention which restores fertility to 
cytoplasmic male sterile plants and modifies expression of toxic mitochondria 
proteins by the plant. This involves incorporating the nucleic acid molecules of the 
present invention into host cells using conventional recombinant DNA technology. 
Generally, this involves inserting the nucleic acid molecule into an expression system 
to which the nucleic acid molecule is heterologous (i.e., not normally present). The 
heterologous nucleic acid molecule is inserted into the expression system which 
includes the necessary elements for the transcription and translation of the inserted 
protein coding sequences. In one embodiment, the isolated expression system of the 
present invention contains the nucleic acid molecule of the present invention in proper 
sense orientation. 

[0078] The nucleic acid molecules of the present invention may be inserted 

into any of the many available expression vectors and cell systems using reagents that 
are well known in the art. Suitable vectors include, but are not limited to, the 
following viral vectors such as lambda vector system gtl 1, gt WES.tB, Charon 4, and 
plasmid vectors such as pBR322, pBR325, pACYC177, pACYC1084, pUC8, pUC9, 
pUC18 3 pUC19, pLG339, pR290, pKC37, pKClOl, SV 40, pBluescript II SK +/- or 
KS +/- (see "Stratagene Cloning Systems" Catalog (1993) from Stratagene, La Jolla, 
CA, which is hereby incorporated by reference in its entirety), pQE, pIH821, pGEX, 
pET series (see Studier et al., "Use of T7 RNA Polymerase to Direct Expression of 
Cloned Genes/' Gene Expression Technology, vol. 185 (1990), which is hereby 
incorporated by reference in its entirety), and any derivatives thereof. Recombinant 
molecules can be introduced into cells via transformation, particularly transduction, 
conjugation, mobilization, or electroporation. The DNA sequences are cloned into the 
vector using standard cloning procedures in the art, as described by Sambrook et al., 
Molecular Cloning: A Laboratory Manual 2nd Edition, Cold Spring Harbor, NY, 
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Cold Spring Harbor Laboratory Press (1989); and Ausubel et al., Current Protocols in 
Molecular Biology, John Wiley & Sons, New York, N.Y., (1989) which are hereby 
incorporated by reference in their entirety. 

[0079] In preparing a DNA vector for expression, the various DNA sequences 

5 may normally be inserted or substituted into a bacterial plasmid. Any convenient 
plasmid may be employed, which will be characterized by having a bacterial 
replication system, a marker which allows for selection in a bacterium and generally 
one or more unique, conveniently located restriction sites. Numerous plasmids, 
referred to as transformation vectors, are available for plant transformation. The 

10 selection of a vector will depend on the preferred transformation technique and target 
species for transformation. A variety of vectors are available for stable 
transformation using Agrobacterium tumefaciens, a soilborne bacterium that causes 
crown gall. Crown gall are characterized by tumors or galls that develop on the lower 
stem and main roots of the infected plant. These tumors are due to the transfer and 

15 incorporation of part of the bacterium plasmid DNA into the plant chromosomal 

DNA. This transfer DNA (T-DNA) is expressed along with the normal genes of the 
plant cell. The plasmid DNA, pTi or Ti-DNA, for "tumor inducing plasmid," 
contains the vir genes necessary for movement of the T-DNA into the plant. The T- 
DNA carries genes that encode proteins involved in the biosynthesis of plant 

20 regulatory factors, and bacterial nutrients (opines). The T-DNA is delimited by two 
25 bp imperfect direct repeat sequences called the "border sequences." By removing 
the oncogene and opine genes, and replacing them with a gene of interest, it is 
possible to transfer foreign DNA into the plant without the formation of tumors or the 
multiplication of Agrobacterium tumefaciens (Fraley et al., "Expression of Bacterial 

25 Genes in Plant Cells," Proc. Nafl Acad. Sci., 80:4803-4807 (1983), which is hereby 
incorporated by reference in its entirety). 

[0080] Further improvement of this technique led to the development of the 

binary vector system (Bevan, "Binary Agrobacterium Vectors for Plant 
Transformation," Nucleic Acids Res.. 12:871 1-8721 (1984), which is hereby 
30 incorporated by reference in its entirety). In this system, all the T-DNA sequences 
(including the borders) are removed from the pTi, and a second vector containing T- 
DNA is introduced into Agrobacterium tumefaciens. This second vector has the 
advantage of being replicable in E. coli as well as A. tumefaciens, and contains a 
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multiclonal site that facilitates the cloning of a transgene. An example of a commonly 
used vector is pBinl9 (Frisch et al., "Complete Sequence of the Binary Vector 
Binl9," Plant Molec. Biol., 27:405-409 (1995), which is hereby incorporated by 
reference in its entirety). Any appropriate vectors now known or later described for 
5 genetic transformation are suitable for use with the present invention. 

[0081] U.S. Patent No. 4,237,224 issued to Cohen and Boyer, which is hereby 

incorporated by reference in its entirety, describes the production of expression 
systems in the form of recombinant plasmids using restriction enzyme cleavage and 
ligation with DNA ligase. These recombinant plasmids are then introduced by means 

10 of transformation and replicated in unicellular cultures including prokaryotic 
organisms and eukaryotic cells grown in tissue culture. 
[0082] Certain "control elements" or "regulatory sequences" are also 

incorporated into the vector-construct. These include non-translated regions of the 
vector, promoters, and 5' and 3' untranslated regions which interact with host cellular 

15 proteins to carry out transcription and translation. Such elements may vary in their 
strength and specificity. Depending on the vector system and host utilized, any 
number of suitable transcription and translation elements, including constitutive and 
inducible promoters, may be used. 

[0083] A constitutive promoter is a promoter that directs expression of a gene 

20 throughout the development and life of an organism. Examples of some constitutive 
promoters that are widely used for inducing expression of transgenes include the 
nopoline synthase ("NOS") gene promoter, from Agrobacterium tumefaciens (U.S. 
Patent No. 5,034,322 issued to Rogers et al., which is hereby incorporated by 
reference in its entirety), the cauliflower mosaic virus ("CaMV") 35S and 19S 
25 promoters (U.S. Patent No. 5,352,605 issued to Fraley et al., which is hereby 

incorporated by reference in its entirety), those derived from any of the several actin 
genes, which are known to be expressed in most cells types (U.S. Patent No. 
6,002,068 issued to Privalle et al., which is hereby incorporated by reference in its 
entirety), and the ubiquitin promoter ('*ubi"), which is the promoter of a gene product 
30 known to accumulate in many cell types. 

[0084] An inducible promoter is a promoter that is capable of directly or 

indirectly activating transcription of one or more DNA sequences or genes in response 
to an inducer. In the absence of an inducer, the DNA sequences or genes will not be 
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transcribed. The inducer can be a chemical agent, such as a metabolite, growth 
regulator, herbicide or phenolic compound, or a physiological stress directly imposed 
upon the plant such as cold, heat, salt, toxins, or through the action of a pathogen or 
disease agent such as a virus or fungus. A plant cell containing an inducible promoter 
5 may be exposed to an inducer by externally applying the inducer to the cell or plant 
such as by spraying, watering, heating, or by exposure to the operative pathogen. An 
example of an appropriate inducible promoter for use in the present invention is a 
glucocorticoid-inducible promoter (Schena et al., " A Steroid-Inducible Gene 
Expression System for Plant Cells," Proa Natl. Acad. Sci.. 88:10421-5 (1991), which 

10 is hereby incorporated by reference in its entirety). Expression of the transgene- 

encoded protein is induced in the transformed plants when the transgenic plants are 
brought into contact with nanomolar concentrations of a glucocorticoid, or by contact 
with dexamethasone, a glucocorticoid analog (Schena et al., " A Steroid-Inducible 
Gene Expression System for Plant Cells," Proc. Natl. Acad. Sci. USA, 88:10421-5 

15 (1991); Aoyama et al., "A Glucocorticoid-Mediated Transcriptional Induction System 
in Transgenic Plants," Plant J., 1 1 : 605-612 (1997), and McNellis et al., 
"Glucocorticoid-Inducible Expression of a Bacterial Avirulence Gene in Transgenic 
Arabidopsis Induces Hypersensitive Cell Death, Plant J., 14(2):247-57 (1998), which 
are hereby incorporated by reference in their entirety). In addition, inducible 

20 promoters include promoters that function in a tissue specific manner to regulate the 
gene of interest within selected tissues of the plant. Examples of such tissue specific 
promoters include seed, flower, or root specific promoters as are well known in the 
field (U.S. Patent No. 5,750,385 issued to Shewmaker et al., which is hereby 
incorporated by reference in its entirety). In the preferred embodiment of the present 

25 invention, a heterologous promoter is linked to the nucleic acid of the construct, 

where "heterologous promoter" is defined as a promoter to which the nucleic acid of 
the construct is not linked in nature. 

[0085] The nucleic acid construct also includes an operable 3 ' regulatory 

region, selected from among those which are capable of providing correct 
30 transcription termination and polyadenylation of mRNA for expression in the host cell 
of choice, operably linked to a DNA molecule which encodes for a protein of choice. 
A number of 3' regulatory regions are known to be operable in plants. Exemplary 3' 
regulatory regions include, without limitation, the nopaline synthase ("nos") 3' 
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regulatory region (Fraley et aL, "Expression of Bacterial Genes in Plant Cells," Proc. 
Natl Acad. Sci. USA, 80:4803-4807 (1983), which is hereby incorporated by 
reference in its entirety) and the cauliflower mosaic virus ("CaMV") 3' regulatory 
region (Odell et aL, "Identification of DNA Sequences Required for Activity of the 
5 Cauliflower Mosaic Virus 35S Promoter ," Nature. 313(6005):810-812 (1985), which 
is hereby incorporated by reference in its entirety). Virtually any 3' regulatory region 
known to be operable in plants would suffice for proper expression of the coding 
sequence of the nucleic acid of the present invention. 

[0086] The vector of choice, suitable promoter, and an appropriate 3' 

10 regulatory region can be ligated together to produce the nucleic acid construct which 
contains the nucleic acid molecule of the present invention, or suitable fragments 
thereof, using well known molecular cloning techniques as described in Sambrook et 
aL, Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor, NY, 
Cold Spring Harbor Laboratory Press (1989); Ausubel et aL, Current Protocols in 
15 Molecular Biology, John Wiley & Sons, New York, N.Y. (1989), which are hereby 
incorporated by reference in their entirety. 

[0087] Once the nucleic acid construct has been prepared, it is ready to be 

incorporated into a host cell. Accordingly, in another embodiment, the present 
invention is an isolated host cell containing the nucleic acid molecule of the present 

20 invention which restores fertility to cytoplasmic male sterile plants and modifies 
expression of toxic mitochondria proteins by the plant. Basically, this method is 
carried out by transforming a host cell with the expression system of the present 
invention under conditions effective to yield transcription of the nucleic acid molecule 
in the host cell, using standard cloning procedures known in the art, such as described 

25 by Sambrook et aL, Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold 
Spring Harbor, NY, Cold Spring Harbor Laboratory Press, (1989), which is hereby 
incorporated by reference in its entirety. Suitable host cells include, but are not 
limited to, bacteria, virus, yeast, mammalian cells, insect, plant, and the like. 
Preferably the host cells are either a bacterial cell or a plant cell. Methods of 

30 transformation may result in transient or stable expression of the DNA under control 
of the promoter. Preferably, the nucleic acid construct of the present invention is 
stably inserted into the genome of the recombinant plant cell as a result of the 
transformation, although transient expression can serve an important purpose, 
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particularly when the plant under investigation is slow-growing. Plant tissue suitable 
for transformation include leaf tissue, root tissue, meristems, zygotic and somatic 
embryos, callus, protoplasts, tassels, pollen, embryos, anthers, and the like. The 
means of transformation chosen is that most suited to the tissue to be transformed. 
5 [0088] An appropriate method of stably introducing the nucleic acid construct 

into plant cells is to infect a plant cell with Agrobacterium tumefaciens or 
Agrobacterium rhizogenes previously transformed with the nucleic acid construct. As 
described above, the Ti (or RI) plasmid of Agrobacterium enables the highly 
successful transfer of a foreign DNA into plant cells. Another approach to 

10 transforming plant cells with a gene which imparts resistance to pathogens is particle 
bombardment (also known as biolistic transformation) of the host cell, as disclosed in 
U.S. Patent Nos. 4,945,050, 5,036,006, and 5,100,792, all to Sanford et al., and in 
Emerschad et al., "Somatic Embryogenesis and Plant Development from Immature 
Zygotic Embryos of Seedless Grapes (Vitis vinifera)," Plant Cell Reports, 14:6-12 

15 (1995), which are hereby incorporated by reference in their entirety. Yet another 
method of introduction is fusion of protoplasts with other entities, either minicells, 
cells, lysosomes or other fusible lipid-surfaced bodies (Fraley et al., Proc. Natl. Acad. 
Sci. USA. 79:1859-63 (1982), which is hereby incorporated by reference in its 
entirety). The DNA molecule may also be introduced into the plant cells by 

20 electroporation (Fromm et al., Proc. Natl. Acad. Sci. USA, 82:5824 (1985), which is 
hereby incorporated by reference in its entirety). In this technique, plant protoplasts 
are electroporated in the presence of plasmids containing the expression cassette. 
Electrical impulses of high field strength reversibly permeabilize biomembranes 
allowing the introduction of the plasmids. Electroporated plant protoplasts reform the 

25 cell wall, divide, and regenerate. The precise method of transformation is not critical 
to the practice of the present invention. Any method that results in efficient 
transformation of the host cell of choice is appropriate for practicing the present 
invention. 

[0089] After transformation, the transformed plant cells must be regenerated. 

30 Plant regeneration from cultured protoplasts is described in Evans et al., Handbook of 
Plant Cell Cultures, Vol. 1, MacMillan Publishing Co., NY (1983); Vasil (ed.), Cell 
Culture and Somatic Cell Genetics of Plants. Acad. Press, Orlando, Vol. I, 1 984, and 
Vol. Ill (1986); and Fitch et al, "Somatic Embryogenesis and Plant Regeneration 
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from Immature Zygotic Embryos of Papaya (Carica papaya L.)," Plant Cell Rep.. 
9:320 (1990), which are hereby incorporated by reference in their entirety. 
[0090] Means for regeneration vary from species to species of plants, but 

generally a suspension of transformed protoplasts or a petri plate containing explants 
5 is first provided. Callus tissue is formed and shoots may be induced from callus and 
subsequently rooted. Alternatively, embryo formation can be induced in the callus 
tissue. These embryos germinate as natural embryos to form plants. The culture 
media will generally contain various amino acids and hormones, such as auxin and 
cytokinins. Efficient regeneration will depend on the medium, on the genotype, and 
10 on the history of the culture. If these three variables are controlled, then regeneration 
is usually reproducible and repeatable. 

[0091] Preferably, transformed cells are first identified using a selection 

marker simultaneously introduced into the host cells along with the nucleic acid 
construct of the present invention. Suitable selection markers include, without 

15 limitation, markers encoding for antibiotic resistance, such as the nptll gene which 
confers kanamycin resistance (Fraley et al, "Expression of Bacterial Genes in Plant 
Cells," Proc. Natl. Acad. Sci. USA. 80:4803-4807 (1983), which is hereby 
incorporated by reference in its entirety), and the genes which confer resistance to 
gentamycin, G418, hygromycin, streptomycin, spectinomycin, tetracycline, 

20 chloramphenicol, and the like. Cells or tissues are grown on a selection medium 
containing the appropriate antibiotic, whereby generally only those transformants 
expressing the antibiotic resistance marker continue to grow. Other types of markers 
are also suitable for inclusion in the expression cassette of the present invention. For 
example, a gene encoding for herbicide tolerance, such as tolerance to sulfonylurea is 

25 useful, or the dhfr gene, which confers resistance to methotrexate (Bourouis et aL, 
EMBO J., 2:1099-1 104 (1983), which is hereby incorporated by reference in its 
entirety). Similarly, "reporter genes," which encode for enzymes providing for 
production of an identifiable compound are suitable. The most widely used reporter 
gene for gene fusion experiments has been uidA, a gene from Escherichia coli that 

30 encodes the p -glucuronidase protein, also known as GUS (Jefferson et al., "GUS 

Fusions: p Glucuronidase as a Sensitive and Versatile Gene Fusion Marker in Higher 
Plants," EMBO J., 6:3901-3907 (1987), which is hereby incoiporated by reference in 
its entirety). Similarly, enzymes providing for production of a compound identifiable 
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by luminescence, such as luciferase, are useful. The selection marker employed will 
depend on the target species; for certain target species, different antibiotics, herbicide, 
or biosynthesis selection markers are preferred. 

[0092] Plant cells and tissues selected by means of an inhibitory agent or other 

5 selection marker are then tested for the acquisition of the transgene by Southern blot 
hybridization analysis, using a probe specific to the transgenes contained in the given 
cassette used for transformation (Sambrook et al., " Molecular Cloning: A Laboratory 
Manual " Cold Spring Harbor, NY, Cold Spring Harbor Laboratory Press (1989), 
which is hereby incorporated by reference in its entirety). 

10 [0093] After the expression cassette is stably incorporated in transgenic plants, 

it can be transferred to other plants by sexual crossing. Any of a number of standard 
breeding techniques can be used, depending upon the species to be crossed. Once 
transgenic plants of this type are produced, the plants themselves can be cultivated in 
accordance with conventional procedure so that the nucleic acid construct is present in 

15 the resulting plants. Alternatively, transgenic seeds or propagules (e.g., cuttings) are 
recovered from the transgenic plants. These seeds can then be planted in the soil and 
cultivated using conventional procedures to produce transgenic plants. 
[0094] Thus, in other embodiments, the present invention includes transgenic 

plants and seeds produced by transformation with the nucleic acid molecule of the 

20 present invention which restores fertility to cytoplasmic male sterile plants and 
modifies expression of toxic mitochondria proteins by the plant. Examples of 
transgenic plants include crop plants such as alfalfa, rice, wheat, barley, rye, cotton, 
sunflower, peanut, com, potato, sweet potato, bean, pea, chicory, lettuce, endive, 
cabbage, brussel sprout, beet, parsnip, turnip, cauliflower, broccoli, radish, spinach, 

25 onion, garlic, eggplant, pepper, celery, carrot, squash, pumpkin, zucchini, cucumber, 
apple, pear, melon, citrus, strawberry, grape, raspberry, pineapple, soybean, tobacco, 
tomato, sorghum, and sugarcane. Other examples of transgenic plants include 
ornamental plants such as Arabidopsis thaliana, Saintpaulia, petunia, pelargonium, 
poinsettia, chrysanthemum, carnation, and zinnia. 

30 [0095] The nucleic acid molecule of the present invention can be utilized to 

restore fertility to cytoplasmic male sterile plants for a wide variety of crop plants and 
ornamental plants. Thus, the present invention also relates to a method of restoring 
fertility to cytoplasmic male sterile plants involving transforming a cytoplasmic male 



WO 03/057859 



PCT/US03/00887 



-47- 

sterile plant with a nucleic acid molecule of the present invention under conditions 
effective to restore fertility to the cytoplasmic male sterile plant. The plant can have 2 
or more copies of the nucleic acid molecule. 

[0096] The nucleic acid molecule of the present invention can be utilized in a 

5 method for identifying genes affecting male fertility or mitochondrial gene expression 
in other species. The first PPR gene family member whose function was 
characterized is crpl, a gene involved in RNA processing in chloroplasts (Fisk et al., 
"Molecular Cloning of the Maize Gene crpl Reveals Similarity Between Regulators 
of Mitochondrial and Chloroplast Gene Expression," EMBO J., 18: 2621-2630 
10 (1999), which is hereby incorporated by reference in its entirety). R/-PPR592 and 
crpl exhibit some sequence similarity, though there are other PPR motif proteins in 
the databases with greater similarity to R/-PPR592. Both of these genes affect 
accumulation of particular RNA transcripts. 

[0097] Thus, another aspect of the present invention relates to a method of 

15 identifying a candidate gene restoring fertility in plants. The method involves 

analyzing the candidate gene for the presence of the above nucleic acid molecule in 
accordance with the present invention. 

[0098] Identification of the nucleic acid molecules of the present invention 

suggests new strategies for identification of restorers or nuclear male sterility (ms) 

20 alleles in crop species that are more important agriculturally than petunia. Thus, 

another aspect of the present invention relates to a method of identifying a candidate 
plant suitable for breeding with a cytoplasmic male sterile plant. The method 
involves analyzing the candidate plant for the presence, in its genome, of the above 
nucleic acid molecule of the present invention. 

25 [0099] For example, it is possible that a nucleic acid molecule of the present 

invention corresponds to a restorer allele in rice. Since the complete genome 
sequence of rice is publicly available, using the above-described method for 
identifying PPR motif-containing genes, candidates for rice restorer genes can be 
identified in the rice chromosomal region which is genetically linked to the rice 

30 restoration phenotype. Using standard methods of cloning and rice transformation, 
the candidate rice restorer gene can be introduced as a transgene into a rice CMS line 
and the fertility of the transformants can be evaluated to determine whether the PPR 
gene is actually a restorer gene. 
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[0100] The fact that the nucleic acid molecules of the present invention can be 

a PPR motif gene can also be used to identify putative genes in other species that 
might encode male sterility when disrupted. The homolog of a restorer gene in one 
species could, when mutated, be a male sterility-encoding gene in another species. 
5 Creating a male sterile line can be valuable for certain applications. For example, 
flowers of petunia and some other horticultural species undergo a phenomenon called 
pollination-induced senescence (Xu et al., "Programmed Cell Death During 
Pollination-Induced Petal Senescence in Petunia," Plant Phys., 122:1323-1333 (2000), 
which is hereby incorporated by reference in its entirety). Flowers are triggered to 

10 senesce when pollinated. A male sterile flower will last longer when the plant is male 
sterile, because no self pollen will be available to pollinate it. 
[0101] Since it is possible to introduce genes into yeast mitochondria, it is 

likely that methods for introducing genes into plant mitochondria can be developed. 
When it becomes possible to introduce the pcf gene or a toxic homolog containing the 

15 sequences on which the nucleic acid molecule of the present invention operates, then 
a new CMS/restorer system can be created in a different species. In such a system, a 
male sterile line is created by introducing pcf or a jocfhomolog into the mitochondrial 
genome. This CMS line can then be crossed with a line containing the nucleic acid 
molecule of the present invention in the nuclear genome, introduced by standard 

20 transformation methods, to create hybrid seed that will give rise to fertile progeny 
plants. 

[0102] Thus, the present invention also relates to a method of producing 

hybrid plant seed. The method first involves providing a cytoplasmic male sterile 
plant. Next, a second plant containing the above nucleic acid molecule in accordance 
25 with the present invention is provided. Finally, the cytoplasmic male sterile plant and 
the second plant are bred under conditions effective to produce hybrid progeny seed 
which yield fertile plants. 

[0103] Another aspect of the present invention relates to a method of 

producing plant seeds for an inbred line of plants. The method first involves 
30 providing a cytoplasmic male sterile plant. Next, a second plant containing the above 
nucleic acid molecule in accordance with the present invention is provided. Then, the 
cytoplasmic male sterile plant and the second plant are bred under conditions effective 
to produce hybrid progeny seed which yield fertile plants. Next, hybrid fertile plants 
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are produced from the hybrid progeny seeds Finally, the hybrid fertile plants and the 
second plant are backcrossed to produce seed which yield inbred progeny plants. 
[0104] Backcrossing methods can be used with the present invention to 

improve or introduce a characteristic into the present cultivar. The term backcrossing 
5 refers to the repeated crossing of a hybrid progeny back to one of the parental plants 
for that hybrid. The parental plant which contributes the locus for the desired 
characteristic is termed the nonrecurrent or donor parent. This terminology refers to 
the fact that the nonrecurrent parent is used one time in the backcross protocol and, 
therefore, does not recur. The parental plant to which the locus or loci from the 

10 nonrecurrent parent are transferred is known as the recurrent parent as it is used for 
several rounds in the backcrossing protocol (Poehlman et al., Breeding Field Crops, 
4th Ed., Ames, Iowa, Iowa State University Press, (1995); Fehr, ed., Principles of 
Cultivar Development, Vol. 1: Theory and Technique, NY, NY, Macmillan 
Publishing Company (1987); and Fehr, ed., Principles of Cultivar Development, Vol. 

15 2: Crop Species, NY, NY, Macmillan Publishing Company (1987), which are hereby 
incorporated by reference in their entirety). 

[0105] In a typical backcross protocol, the phenotypically and/or 

commercially appealing cultivar or accession (recurrent parent) is crossed with a 
second cultivar (nonrecurrent parent) that carries the single locus of interest (e.g., the 

20 GSB resistance gene locus) to be transferred. The resulting progeny from this cross 
are then crossed again to the recurrent parent and the process is repeated until a plant 
is obtained where essentially all of the desired morphological and physiological 
characteristics of the recurrent parent are recovered in the converted plant, in addition 
to the single transferred locus from the nonrecurrent parent. 

25 [0106] The selection of a suitable recurrent parent is an important step for a 

successful backcrossing procedure. The goal of a backcross protocol is to alter or 
substitute a single trait or characteristic in the original cultivar or accession. To 
accomplish this, a single locus of the recurrent cultivar is modified or substituted with 
the desired locus from the nonrecurrent parent, while retaining essentially all of the 

30 rest of the desired genetic, and therefore the desired physiological and morphological 
constitution of the original cultivar. The choice of the particular nonrecurrent parent 
will depend on the purpose of the backcross; one of the major purposes is to add some 
commercially desirable, agronomically important trait to the plant. The exact 
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backcrossing protocol will depend on the characteristic or trait being altered to 
determine an appropriate testing protocol. Although backcrossing methods are 
simplified when the characteristic being transferred is a dominant allele, a recessive 
allele may also be transferred. In this instance it may be necessary to introduce a test 
5 of the progeny to determine if the desired characteristic has been successfully 
transferred. 

[0107] A technique, known as modified backcrossing, uses different recurrent 

parents during the backcrossing. Modified backcrossing may be used to replace the 
original recurrent parent with a cultivar having certain more desirable characteristics 
10 or multiple parents may be used to obtain different desirable characteristics from 
each. 

[0108] Presently, it is possible to introduce genes into chloroplast genomes, 

which are maternally inherited in many species. Therefore, a CMS/restorer system 
can also be created by introducing the pcf 'locus, modified for chloroplast expression, 

15 into the chloroplast genome. The nucleic acid molecule of the present invention can 
be modified to replace the mitochondrial transit sequence with a chloroplast transit 
sequence, using standard methods (Kohler et ah, "Exchange of Protein Molecules 
Through Connections Between Higher Plant Plastids," Science, 276:2039-2042 
(1997), which is hereby incorporated by reference in its entirety), so that the protein 

20 encoded by the nucleic acid molecule of the present invention will turn off toxic gene 
expression in the chloroplast. Without the restorer, PCF in the chloroplast is likely to 
be toxic; PCF is toxic to E. coli bacteria. Thus, the present invention also relates to a 
method of producing plants with a cytoplasmic male sterile plant restoration system. 
The method first involves transforming a first plant in its chloroplast genome with a 

25 nucleic acid which causes the plant to become male sterile. Next, a second plant is 
transformed with the above nucleic acid molecule in accordance with the present 
invention whose protein product is targeted to the chloroplast. Finally, the first and 
second plants are crossed to produce progeny plants possessing a cytoplasmic male 
sterile plant restoration system. 

30 [0109] Since the nucleic acid molecule of the present invention prevents the 

expression of an organelle gene, it can be used to control the expression of a chimeric 
gene introduced into chloroplasts. If there is a useful protein that is desired to be 
produced from plant chloroplasts by introduction of a gene encoding the valuable 
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protein into the chloroplast genome, production of the valuable protein could 
deliberately be turned off by expressing the nucleic acid molecule of the present 
invention from a conditional promoter. When desired, the expression of the nucleic 
acid molecule of the present invention could be turned off, so that the valuable protein 
5 is produced. In further detail, the method of the present invention involves: (1) 

engineering a chimeric gene including the coding region of a desirable protein and the 
pcf gene sequences that are regulated by the nucleic acid molecule of the present 
invention; (2) introducing this chimeric gene into chloroplasts of a plant and obtaining 
a chloroplast transgenic line; (3) engineering the nucleic acid molecule of the present 

10 invention or its homologs so that the protein is targeted into chloroplasts; (4) 

introducing the engineered nucleic acid molecule of the present invention into the 
nuclear genome of a plant and obtaining a nuclear transgenic line; and (5) crossing 
plants to set up a regulated system. Alternatively, the plant in (2) can be made first 
and the gene in (3) introduced by transformation, or the plant in (4) can be made first 

15 and the gene in (2) introduced into the plant in (4). When it becomes possible to 
transform plant mitochondrial genomes, the chimeric gene containing the pcf 
sequence can be introduced into mitochondria and the product of the nucleic acid 
molecule of the present invention can be targeted to mitochondria to create the 
analogous system. 

20 [0110] The nucleic acid molecule of the present invention must be expressed 

in most of the plant, because in every tissue examined, the PCF protein is reduced in 
the presence of the nucleic acid molecule of the present invention (Nivison et al., 
"Identification of a Mitochondrial Protein Associated With Cytoplasmic Male 
Sterility in Petunia," Plant Cell 1:1121-30 (1989), Nivision et al., "Sequencing, 

25 Processing, and Localization of the Petunia CMS-Associated Mitochondrial Protein," 
Plant J., 5:613-623 (1994), which are hereby incorporated by reference in their 
entirety). The PCF protein does nevertheless vary in abundance (Conley et al., 
"Tissue-Specific Protein Expression in Plant Mitochondria," Plant Cell, 6:85-91 
(1994), which is hereby incorporated by reference in its entirety) and is more highly 

30 expressed in anthers. Very few promoters have been identified that confer expression 
in many tissues and in developing microspores at an early stage of pollen 
development. Thus, the promoter sequence of the nucleic acid molecule of the 
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present invention could be useful to express any of many different coding regions in a 
variety of tissues. 

[0111] It is also possible to dissect the regulatory sequences of the nucleic 

acid molecule of the present invention by standard methods to identify those regions 
that confer expression in particular tissue types. Typically, such regulatory sequences 
are 5' to the coding region, though the 3' flanking region can also be important. The 
most novel aspect of the regulatory sequences of the nucleic acid molecule of the 
present invention is that they confer expression in early microsporogenesis. Most of 
the published anther-specific promoters are not effective at the early stage of pollen 
development, when it is critical to restore proper mitochondrial function in plants 
carrying the CMS cytoplasm. For example, the promoter of a nucleic acid molecule 
of the present invention could be used with a different coding region to restore 
fertility to a species with a different CMS-encoding gene. Alternatively, the promoter 
could be used to control a gene toxic to pollen to confer male sterility, or regulatory 
elements from this promoter could be combined with those of another promoter to 
confer expression in early microsporogenesis. 

[0112] Thus, another aspect of the present invention relates to a method of 

directing gene expression to plant mitochondria. The method involves transforming a 
plant with a chimeric nucleic acid molecule containing a transgene operatively linked 
to a promoter or a terminator from a plant gene which restores fertility to cytoplasmic 
male sterile plants and modifies expression of toxic mitochondria proteins by the plant 
under conditions effective to direct expression of the transgene in the mitochondria of 
the transformed plant. The promoter has a nucleotide sequence of from nucleotide 1 
to 1981 of SEQ ID NO: 1. The terminator has a nucleotide sequence of from 
nucleotide 3761 to 4593 of SEQ ID NO: 1. 

[0113] The present invention also relates to a promoter from a plant gene 

which restores fertility to cytoplasmic male sterile plants and modifies expression of 
toxic mitochondria proteins by the plant under conditions effective to direct 
expression of the transgene in the mitochondria of the transformed plant. The 
promoter has a nucleotide sequence of from nucleotide 1 to 1981 of SEQ ID NO: 1. 
[0114] Another aspect of the present invention relates to a terminator from a 

plant gene which restores fertility to cytoplasmic male sterile plants and modifies 
expression of toxic mitochondria proteins by the plant under conditions effective to 



WO 03/057859 



PCT/US03/00887 



-53 - 

direct expression of the transgene in the mitochondria of the transformed plant. The 
terminator has a nucleotide sequence of from nucleotide 3761 to 4593 of SEQ ID NO: 
1. 

[0115] Another aspect of the present invention relates to a nucleic acid 

5 construct. The nucleic acid construct includes: (i) a promoter or a terminator from a 
plant gene which restores fertility to cytoplasmic male sterile plants and modifies 
expression of toxic mitochondria proteins by the plant under conditions effective to 
direct expression of the transgene in the mitochondria of the transformed plant and (ii) 
a nucleic acid heterologous to and operatively coupled to the promoter or the 
10 terminator. The promoter has a nucleotide sequence of from nucleotide 1 to 1981 of 
SEQ ID NO: 1 . The terminator has a nucleotide sequence of from nucleotide 3761 to 
4593 of SEQ ID NO: L 

[0116] Other embodiments of the present invention include isolated 

expression systems, host cells, transgenic plants, and transgenic plant seeds 

15 containing the nucleic acid construct of the present invention. 

[0117] Another aspect of the present invention is a method of expressing a 

gene preferentially in roots of a plant. The method involves transforming a plant with 
a nucleic acid construct containing a promoter suitable for driving expression 
preferentially in roots having a nucleotide sequence of from 1 to 1388 of SEQ ID NO: 

20 44; a nucleic acid heterologous to the promoter, where the promoter is operatively 
coupled 5' to the nucleic acid to induce transcription of the nucleic acid; and a 
terminator having a nucleotide sequence of from nucleotide 3 1 68 to 4016 of SEQ ID 
NO: 44, where the terminator is operably coupled 3' to the nucleic acid. This method 
can be used to express genes in roots of a plant, but not in stems, leaves, or buds. 

25 [0118] The nucleic acid molecule of the present invention or its homologues 

could also be used to deliberately alter floral morphology to produce novel flowers. 
Thus, in another embodiment, the present invention is a method of altering plant floral 
morphology in ornamental plants by transforming an ornamental plant with a nucleic 
acid molecule of the present invention which restores fertility to cytoplasmic male 

30 sterile plants and modifies expression of toxic mitochondria proteins by the plant. 
Combinations of certain nuclear genes with particular mitochondrial backgrounds 
have been found to result in altered floral morphology in some genera. For example, 
in tobacco, combinations of the nuclear genome of one species with the cytoplasm of 



WO 03/057859 



PCT/US03/00887 



-54- 

another sometimes results in very abnormal flowers, such as flowers in which anthers 
have been converted to petals. While these particular plants may not be desirable 
horticulturally, it is possible that the coding region or expression of the nucleic acid 
molecule of the present invention could be manipulated so that interesting, valuable 
5 floral alterations could be obtained. For example, flowers with a second set of petals 
in place of anthers could be attractive. A similar strategy could be pursued with other 
species in which novel floral morphology is desirable. Manipulation of the nucleic 
acid molecule of the present invention could occur, for example, by overexpressing it 
on a different promoter, changing the coding region, or using standard antisense or 

10 gene silencing methods to underexpress homologous genes. 

[0119] Another aspect of the present invention relates to a method of 

producing plants with a cytoplasmic male sterile plant restoration system. The 
method first involves mutagenizing a first plant having a nucleic acid which encodes a 
protein. The protein has a motif having an amino acid sequence corresponding to any 

15 of SEQ ID NOs: 3 to 18 or an amino acid sequence identified with a METAMEME 
software using the amino acid sequence of SEQ ID NO: 2 as input or an amino acid 
sequence identified as significantly similar to SEQ ID NO: 2 using a NCBI BLAST 
software (threshold^ E less than or equal to 15) with SEQ ID NO: 2 as input. Next, 
the mutagenized first plant is crossed with a wild-type plant having mitochondrial 

20 DNA polymorphisms compared to mitochondrial DNA in the mutagenized first plant 
to produce progeny plants. Finally, it is determined if the progeny plants are fertile, 
whereby fertile progeny plants can be used as a fertile maintainer line, where the 
mutagenized first plant, the fertile maintainer line, and a wild-type allele present in 
the first plant before mutagenesis comprises a new cytoplasmic male sterile plant 

25 restoration system. Figures 7A-B show this aspect of the present invention in detail. 
[0120] The Arabidopsis thaliana genome sequence contains a PPR gene 

highly similar to the nucleic acid molecule of the present invention. There are 
existing, publicly available insertional element collections that can be screened by 
standard methods to find a mutant in which the homolog of the nucleic acid molecule 

30 of the present invention is disrupted. The mutant can be examined to determine 

whether it encodes male sterility. Because the nucleic acid molecule of the present 
invention is important for nuclear/organelle interaction to produce male fertility, its 
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homologs in other species are likely to be essential for proper pollen development in 
those species. 

[0121] By mutating a PPR gene in a plant that does not have a CMS/restorer 

system, such a system can be created. In this strategy, a PPR gene is mutated and the 
5 plant becomes male sterile. The mutated PPR alleles are then crossed with plants 
carrying other cytoplasms, present in other varieties of the plant or in intercrossable 
species. If a cytoplasm can be found in which the plant is fertile in the presence of the 
two mutated PPR alleles, then a new CMS/restorer system will have been created. A 
line carrying the new cytoplasm plus the mutated alleles becomes the maintainer line. 
10 The line carrying the first cytoplasm plus the mutated alleles becomes rf/rf CMS. A 
line carrying an unmutated allele plus a mutated allele in the presence of the CMS 
cytoplasm becomes Rf/rf CMS. These lines can then be exploited just as standard 
maintainer, sterile, and restored lines are currently used in hybrid seed production 
(Figure 7). 

15 [0122] For example, tomato does not have a CMS/restorer system. It is 

known that markers near petunia Rf map to a region of the tomato genome where two 
nuclear male sterility alleles exist. Possibly, the tomato ortholog of the nucleic acid 
molecule of the present invention, when mutated, results in male sterility. If so, then 
the cytoplasms of the intercrossable wild tomato species can be tested to determine 

20 whether they can confer male fertility to a tomato line homozygous for the mutated 
PPR gene. 

[0123] The nucleic acid molecule of the present invention may not be usable 

directly to restore fertility to CMS lines of most other species. Current information 
indicates that different mitochondrial genes are present in different CMS lines. In 

25 most cases, restorer genes will have a specific mechanism of action — suppression of 
expression of the abnormal mitochondrial gene. However, by chance, there may be a 
few species that carry a CMS cytoplasm whose abnormality can be ameliorated by the 
nucleic acid molecule of the present invention. This can be determined by 
introducing the nucleic acid molecule of the present invention into the other species 

30 and determining whether the transgenic plants become male fertile. If so, the nucleic 
acid molecule of the present invention can be used as a fertility restorer for this 
species. 
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EXAMPLES 

[0124] The following examples are provided to illustrate embodiments of the 

present invention but are by no means intended to limit its scope. 



5 Example 1 - Identification of Two PPR-Containing ORFs as Potential 
Candidates for the Rf Gene 

[0125] Previously, the isolation of a 37.5-kb BIB AC clone, SB5, that 

cosegregates with the Rf gene has been reported (Bentolila et al., "Identification of a 

10 BIBAC Clone That Co-Segregates With the Petunia Restorer of Fertility (Rf) Gene/' 
Mol. Genet Genomics, 266:223-230 (2001), which is hereby incorporated by 
reference in its entirety). SB5 is part of a contig that was constructed by screening a 
Petunia BIBAC library with a marker, EACA/MCTC, tightly linked to Rf. No 
recombination was identified between EACA/MCTC and Rf after examining 

15 1,078 meiotic events. The genetic delimitation of the Rf locus was achieved only 

partially on the BIBAC contig. One extremity of the contig was separated from Rf by 
the occurrence of four recombination events, whereas no crossing-over was found 
between Rf and the other extremity (Bentolila et al., "Identification of a BIBAC 
Clone That Co-Segregates With the Petunia Restorer of Fertility (Rf) Gene," Mol. 

20 Genet. Genomics, 266:223-230 (2001), which is hereby incorporated by reference in 
its entirety). Because of the possibility that Rf might lie further away in the area not 
covered by the contig, a walk was initiated by screening the BIBAC library with a 
probe lying on the extremity that cosegregates with Rf Unfortunately, the only hits 
were clones already isolated in the contig, demonstrating the presence of a gap in the 

25 Petunia BIBAC library. 

[0126] Before increasing the redundancy of the library to find new clones 

covering the gap, it was determined whether the Rf gene might lie in the SB5 clone. 
Because the BIBAC vector is a binary vector allowing Agrobacterium-mediated plant 
transformation (Hamilton, "A Binary-BAC System for Plant Transformation With 

30 High-Molecular- Weight DNA," Gene, 200:107-1 16 (1997), which is hereby 

incorporated by reference in its entirety), SB5 was used to restore fertility to CMS 
plants. Unfortunately, although SB5 is stable in E. coli, it underwent multiple 
rearrangements when introduced into A. tumefaciens, thus precluding its use in 
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transgenic experiments. Randomly chosen clones of various sizes did not show this 
instability in A. tumefaciens, pointing to special features in the sequence of the SB5 
insert. 

[0127] To address whether Rf might lie in the SB5 clone, shotgun sequencing 

5 of the entire clone was carried out and the predicted ORFs for candidate Rf genes 
were examined. 

[0128] Because of difficulties in contig assembly caused by the presence of 

repeated sequences, BamHI subclones rather than the entire BIBAC clone were used 
as the starting material for shotgun sequencing. DNA was sonicated into 1- to 3-kb 

10 fragments, which were gel purified (Geneclean Spin Kit, Bio 101, Vista, CA), end- 
repaired with T4 DNA polymerase (GIBCO/BRL, Rockfield, MD) in the presence of 
all four dNTPs, and ligated at a mass ratio of 3 inserts to 1 vector into the Smal site of 
the pTrueBlue vector (Genomics One, Buffalo, NY). The ligation product was 
introduced into Electromax DH10B Escherichia coli cells (GIBCO/BRL), and DNA 

15 obtained from the white colonies by minipreparation was sequenced with the T7 
primer in the Cornell BioResource Center. The sequences were assembled into 
contigs with SEQUENCHER (Gene Codes, Ann Arbor, MI). 
[0129] ORFs from BIBAC SB5, their promoter region, and poly(A) signals 

were predicted by using GENSCAN (Burge et al., "Prediction of Complete Gene 

20 Structures in Human Genomic DNA," J. Mol. Biol., 268:78-94 (1 997), which is 
hereby incorporated by reference in its entirety) with the Arabidopsis parameter 
matrix. Duplicated blocks in the Rf locus were determined by aligning the genomic 
sequence against itself by using the dot-plot feature from the MEG ALIGN program 
(DNAstar, Madison, WI) with a 90% match. The presence of a transit peptide in the 

25 ORFs was determined by using PREDOTAR version 0.5 , TARGETP (Emanuelsson 
et al., "Predicting Subcellular Localization of Proteins Based on Their N-Terminal 
Amino Acid Sequence," J. Mol. Biol., 300:1005-1016 (2000); Nielsen et al., 
"Identification of Prokaryotic and Eukaryotic Signal Peptides and Prediction of Their 
Cleavage Sites," Prot. Eng., 10:1-6 (1997), which are hereby incorporated by 

30 reference in their entirety), and MITOPROT (Scharfe et al., Nucleic Acids Res., 
28:155-158 (2000), which is hereby incorporated by reference in its entirety). The 
length of the transit peptide was predicted by TARGETP and MITOPROT. 
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[0130] PPR motifs were identified in Rf-PPR592 and Rf-PPR591 by the 

MEME software (Bailey et al., in "Proceedings of the Second International 
Conference on Intelligent Systems for Molecular Biology," in Altaian, eds. (Am. 
Assoc. Artificial Intelligence Press, Menlo Park, CA), pp. 28-36 (1994), which is 
hereby incorporated by reference in its entirety). The parameters for motif searching 
were set as minimum width = 35, maximum width = 35. The PPR consensus motif 
computed from the comparison of 1,303 motifs has been described previously (Small 
et aL, "The PPR Motif - A TPR-Related Motif Prevalent in Plant Organellar 
Proteins," Trends Biochem. Sci., 25:46-47 (2000), which is hereby incorporated by 
reference in its entirety). 

[0131] Because the Rf gene is expected to be targeted to mitochondria where 

it can act upon the pcf gene to prevent its expression, an ORF predicted to carry 
mitochondrial transit sequences was searched. Two ORFs with putative 
mitochondrial targeting signals were identified. The two ORFs are adjacent to each 
other and appear to have originated from duplications in the promoter and coding 
region, but carry divergent 3 f flanking regions (Figure 1 A). The ORFs were 92% 
identical at the nucleotide level, and the predicted proteins were 93% similar, with C 
termini that differ completely in their final 12 aa. Both ORFs carry PPR motifs; one 
encodes 591 aa and the other encodes 592 aa, and were therefore named Rf-PPR591 
and Rf-PPR592. A third PPR-containing ORF might lie in the vicinity of the two 
PPR-containing ORFs shown in Figure 1 A. On the left extremity lies a genomic 
block that shares high similarity with the end of the coding sequence of Rf-PPR592 
and its terminator region. 

[0132] According to cleavage prediction programs, both putative proteins 

exhibited 28-residue mitochondrial transit peptides. Predicted transit peptides of Rf- 
PPR592 and Rf-PPR591 differed by only one substitution. To determine whether the 
predicted transit peptide could target a passenger protein to mitochondria, 44 codons 
from the 5 f end of the Rf-PPR592 coding region were inserted 5' to the coding region 
of an enhanced GFP. DNAs of this construct and of one known to target GFP to 
mitochondria were bombarded into onion epidermal cells. Both GFPs appeared to be 
localized to the same type of organelle in the single cells shown in Figures IB and C. 
Because the predicted transit peptides of Rf-PPR592 and Rf-PPR591 differed by only 
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one amino acid, it was expected that not only Rf-PPR592 but also Rf-PPR591 would 
be mitochondrial^ localized. 

[0133] Most of the predicted mature protein (87%) of Rf-PPR592 consisted of 

14 PPRs (Figure ID). These repeats extended from the amino acid in position 54 to 
5 the amino acid in position 544 and are organized in two sets of tandem repeats, one 
set containing 3 PPRs from amino acid 54 to amino acid 158, the other set containing 
1 1 PPRs from amino acid 160 to amino acid 544. Because the Rf-PPR591 and Rf- 
PPR592 proteins are 93% similar and differ mainly in the last 12 C-terminal amino 
acids, their organization with respect to PPRs is identical. There was a very good 

10 agreement between the consensus motif derived from the 14 PPRs found in Rf- 

PPR592 (hereafter designated 14 PPR consensus) and the consensus motif derived 
from 1,303 PPRs (hereafter designated 1303 PPR consensus) reported previously 
(Small et aL, "The PPR Motif - A TPR-Related Motif Prevalent in Plant Organellar 
Proteins," Trends Biochem. Sci., 25:46-47 (2000), which is hereby incorporated by 

15 reference in its entirety) (Figure ID). Whenever a discrepancy occured between the 
consensus motif of the 14 PPRs in Rf-PPR592 and the 1303 PPR consensus, the 
difference usually was a conservative substitution. For instance, the aspartic acid in 
the first position of the 14 PPR consensus is replaced by a glutamic acid in the 
1303 PPR consensus. Moreover, when the most frequent amino acid in the 14 PPR 

20 consensus at a given position differed from the corresponding amino acid found in the 
1303 PPR consensus, the amino acid in the 1303 consensus was generally the second 
most frequent in the 14 PPR consensus (glutamic acid at position 1, asparagine at 
position 18, alanine at position 28, tyrosine at position 29; Figure ID). 
[0134] It has been demonstrated that R/-PPR592, a gene encoding a 592-aa 

25 protein containing 14 PPRs, was able to restore fertility to CMS plants. The PPR 
motif, a degenerate 35-aa repeat, has been found in a very large gene family in the 
Arabidopsis genome (Small et aL, "The PPR Motif - A TPR-Related Motif Prevalent 
in Plant Organellar Proteins," Trends Biochem. Sci., 25:46-47 (2000), which is 
hereby incorporated by reference in its entirety). The repeats are organized in tandem 

30 arrays with the number of motifs per peptide ranging from 2 to 26. About two-thirds 
of these Arabidopsis PPR proteins are predicted to be targeted to either mitochondria 
or chloroplasts (Small et al., "The PPR Motif - A TPR-Related Motif Prevalent in 
Plant Organellar Proteins," Trends Biochem. Sci., 25:46-47 (2000), which is hereby 
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incorporated by reference in its entirety). Although distinct from the tetratricopeptide 
repeat (TPR), a motif that is likely to be involved in protein binding, the PPR motif 
shares with the former a predicted spatial structure consisting of two a-helices (Small 
et al., "The PPR Motif - A TPR-Related Motif Prevalent in Plant Organellar 

5 Proteins," Trends Biochem. Sci., 25:46-47 (2000); Das et al., "The Structure of the 
Tetratricopeptide Repeats of Protein Phosphatase 5: Implications for TPR-Mediated 
Protein-Protein Interactions;' EMBO J.. 17:1 192-1 199 (1998), which are hereby 
incorporated by reference in their entirety). Tandem PPRs are thought to form a 
superhelix with a central spiral groove that presumably serves as the ligand-binding 

10 surface in a similar way as the one predicted for the tandem TPRs (Small et aL, "The 
PPR Motif - A TPR-Related Motif Prevalent in Plant Organellar Proteins," Trends 
Biochem. Sci., 25:46-47 (2000), which is hereby incorporated by reference in its 
entirety). However, unlike in the TPR motif, the side chains lining the central groove 
of the PPR are almost exclusively hydrophilic, suggesting that some or all of the PPR 

15 motifs are RNA-binding rather than protein-binding motifs. This hypothesis is 

supported by the involvement in RNA metabolism and/or translation of the very few 
PPR motif-containing proteins characterized so far: maize chloroplast CRP1, involved 
in chloroplast petD RNA processing and petD and petA translation (Fisk et al., 
"Molecular Cloning of the Maize Gene Crpl Reveals Similarity Between Regulators 

20 of Mitochondrial and Chloroplast Gene Expression," EMBO J., 18:2621-2630 (1999), 
which is hereby incorporated by reference in its entirety), Chlamydomonas MCA1, 
required for the accumulation of the chloroplast petA transcript (Lown et al., 
"Chlamydomonas Nuclear Mutants That Fail to Assemble Respiratory or 
Photosynthetic Electron Transfer Complexes," Biochem. Soc. Trans., 29:452- 

25 455(2001), which is hereby incorporated by reference in its entirety), yeast PET309, 
required for the stability and translation of the coxl mitochondrial mRNA (Manthey et 
aL, "The Product of the Nuclear Gene PET309 is Required for Translation of Mature 
mRNA and Stability or Production of Intron-Containing RNAs Derived from the 
Mitochondrial COX1 Locus of Saccharomyces cerevisiae," EMBO J., 14:4031-4043 

30 (1995), which is hereby incorporated by reference in its entirety), and Drosophila 
BSF, which binds to and stabilizes the bicoid mRNA (Mancebo et aL, "BSF Binds 
Specifically to the bicoid mRNA 3 f Untranslated Region and Contributes to 
Stabilization of bicoid mRNA," Mol. Cell. Biol.. 21:3462-3471 (2001), which is 
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hereby incorporated by reference in its entirety). That Petunia i?/"belongs to this 
family is consistent with its similarity of action to crpl, meal, and pet309. Mutations 
in these three genes result in lack of accumulation of a particular transcript and 
reduced abundance of an organelle protein. Likewise, in Petunia restored plants, 
5 among the population of pcf transcripts with different 5' termini, the ones with termini 
at -121 exhibit reduced abundance and the amount of the PCF protein is greatly 
reduced (Pruitt et al., "Transcription of the Petunia mitochondrial CMS-Associated 
Pcf Locus in Male Serile and Fertility-Restored Lines," MoL Gen. Genet., 227:348- 
355 (1991); Nivison et al., "Identification of a Mitochondrial Protein Associated with 

10 Cytoplasmic Male Sterility in Petunia," Plant Cell 1:1 121-1 130 (1989), which are 
hereby incorporated by reference in their entirety). However, the alleles of the other 
PPR genes that are known to reduce RNA and/or protein accumulation are recessive, 
whereas the Petunia Rf allele is dominant. Rf genes from other species have been 
shown to alter the RNA transcript profile of the CMS-associated genes (Wise et al., 

15 "Mutator- Induced Mutations of the rfl Nuclear Fertility Restorer of T-Cytoplasm 
Maize Alter the Accumulation of T-urfl3 Mitochondrial Transcripts," Genetics, 
143:1383-1394 (1996); Singh et al., "Suppression of Cytoplasmic Male Sterility by 
Nuclear Genes Alters Expression of a Novel Mitochondrial Gene Region," Plant Cell, 
3:1349-1362 (1991); Tang et al., "Transcript Processing Internal to a Mitochondrial 

20 Open Reading Frame is Correlated with Fertility Restoration in Male-Sterile 

Sorghum," Plant J.. 10:123-133 (1996); Moneger et al., "Nuclear Restoration of 
Cytoplasmic Male Sterility in Sunflower is Associated with the Tissue-Specific 
Regulation of a Novel Mitochondrial Gene," EMBO J.. 13:8-17 (1994), which are 
hereby incorporated by reference in their entirety). In some cases, restoration has 

25 been shown to result from enhanced processing of the CMS-associated transcripts 

(Tang et al., "Transcript Processing Internal to a Mitochondrial Open Reading Frame 
is Correlated with Fertility Restoration in Male-Sterile Sorghum," Plant J., 10:123- 
133 (1996); Menassa et al., "Post-Transcriptional and Developmental Regulation of a 
CMS-Associated Mitochondrial Gene Region by a Nuclear Restorer Gene," Plant J., 

30 17:491-499 (1999), which are hereby incorporated by reference in their entirety). 

Taken together, these observations suggest that Rfs in other species could also be PPR- 
containing genes like the Petunia Rf. 
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[0135] The data presented here show that a pair of duplicated PPR-containing 

genes, denoted R/-PPR591 and R/-PPR592, lie in the Petunia i?/locus. A third related 
PPR gene might lie in the area not covered by the SB5 BIB AC clone as suggested by 
the high similarity between the sequence available at the end of the clone and the 
5 sequence present at the end of the coding sequence of R/-PPR592 and in its terminator 
region. 

[0136] In Brassica napus, the restorer locus has been shown to affect the 

transcripts of several mitochondrial genes, two of them being associated with the nap 
and pol CMS (Singh et ah, "Nuclear Genes Associated With a Single Brassica CMS 

10 Restorer Locus Influence Transcripts of Three Different Mitochondrial Gene 

Regions," Genetics, 143:505-516 (1996); Li et aL, "Restorer Genes for Different 
Forms of Brassica Cytoplasmic Male Sterility Map to a Single Nuclear Locus That 
Modifies Transcripts of Several Mitochondrial Genes," Proc. Natl. Acad. Sci. USA, 
95:10032-10037 (1998), which are hereby incorporated by reference in their entirety). 

1 5 At the same locus have been mapped Rjp, the restorer gene to the pol CMS, that 

modifies the transcripts of the pol CMS -associated orf224latp6 mitochondrial DNA 
region, Rfh, the restorer gene to the nap CMS that modifies the transcripts of the nap 
CMS-associated orf222/nad5c/orfl39 mitochondrial DNA region, and Mmt (modifier 
of mitochondrial transcripts), a gene that modifies the transcripts of the nad4 gene and 

20 another gene possibly involved in cytochrome c biogenesis (Li et aL, "Restorer Genes 
for Different Forms of Brassica Cytoplasmic Male Sterility Map to a Single Nuclear 
Locus That Modifies Transcripts of Several Mitochondrial Genes," Proc. Natl, Acad. 
Sci. USA. 95:10032-10037 (1998), which is hereby incorporated by reference in its 
entirety). The resolution of the genetic mapping in these studies did not allow the 

25 authors to address whether the three genes represent different alleles of a single gene 
or whether the restorer locus might contain multiple, related, tightly linked genes. A 
similar situation occurs in Sorghum, where at the R/3 locus, one of the two restorers to 
A3 CMS, has been mapped a gene that regulates the transcript-processing activity of 
A3 CMS-associated orfl07 and the Mmtl gene that enhances the transcript processing 

30 of urf209 (Tang et aL, "Cosegregation of Single Genes Associated with Fertility 

Restoration and Transcript Processing of Sorghum Mitochondrial orfl07 and ur/209" 
Genetics, 150:383-391 (1998), which is hereby incorporated by reference in its 



WO 03/057859 



PCT/US03/00887 



-63 - 

entirety). As in Brassica napus, either a multiallelic model or tightly linked genes 
could account for this result. 

[0137] It will be worthwhile to determine whether R/-PPR591 affects the 

profile of mitochondrial transcripts other than pcf'm transgenic plants. If so, it would 
5 strengthen the hypothesis that Rf alleles arise as modifications, perhaps through 

duplication, of existing alleles that control mitochondrial gene expression. According 
to this theory, once CMS occurs in a plant species, there maybe strong selective 
pressure for the plant to overcome it by recruiting preexisting activities and 
redirecting them to down-regulate the expression of CMS-encoding genes. 
10 Conceivably, recombination among closely related PPR-containing genes could have 
led to the appearance of the R/-PPR592 gene. 

Example 2 - A Deletion in the Promoter of rf-PPR592 Prevents Its Expression in 
CMS Floral Buds 

15 

[0138] If one of the candidate ORFs, Rf-PPR591 or Rf-PPR592, is the Rf 

gene, some sequence polymorphism between the allele of these ORFs found in a 
restorer line (RfTRf) and the allele found in a CMS plant (rf7rf) might be expected. 
Presumably some difference in the sequences of the dominant Rf allele vs. the 

20 recessive nonrestoring allele rf must reflect their opposite restoring ability. The 
sequence of rf-PPR592 was obtained by amplifying genomic DNA of a Petunia 
hybrida rf/rf plant, where rf was inherited from a P. hybrida line called 2423, with the 
PfuTurbo Hotstart DNA polymerase (Stratagene, La Jolla, CA) and PCR primers 
flanking Rf-PPR591 (S'-TGCACAGTGTTATATTTAC ATACCC-3 SEQ ID NO: 

25 46) and Rf-PPR592 (S'-TTTATG ATACATGGATTTCAACGAC-3 SEQ ID NO: 

47). A PCR product was obtained only with a primer specific to the 3' flanking region 
of Rf-PPR592, not with a primer specific to the 3' flanking region of Rf-PPR591 . The 
rf-PPR592 PCR product showed a reduction in size of about 500nt compared with the 
Rf-PPR592 PCR product amplified from the genomic DNA of an RfTRf line (Figure 

30 2A). Using the same primers, a PCR product similar in size to rf-PPR592 was 

amplified from another nonrestoring P. hybrida line as well as from a nonrestoring 
Petunia parodii line. The rf-PPR592 PCR product amplified from the P. hybrida 
2423 sequence was cloned into the pCR-Blunt II-TOPO vector (Invitrogen, Carlsbad, 
CA) and sequenced, revealing a gene 97% identical to Rf-PPR591 and 94% identical 
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to Rf-PPR592 in the coding region, with the predicted proteins 98% and 94% similar, 
respectively. Because of the primer design, the rf-PPR592 sequence lacks 35 nt 
available for Rf-PPR592. Similarity blocks between rf-PPR592, Rf-PPR592, and Rf- 
PPR591 were determined by comparing the aligned sequences with SEQUENCHER. 
5 Percent similarity was computed by using the MEGALIGN program. Comparison of 
the similarities of regions of the three different PPR genes revealed that the 5 f 
promoter region of rf-PPR592 is most similar to Rf-PPR591, whereas the 3 f flanking 
region of rf-PPR592 is most similar to Rf-PPR592. The genomic structure of rf- 
PPR592 was consistent with the past occurrence of recombination between two genes 

10 similar to Rf-PPR591 and Rf-PPR592 (Figure 2B). Because PCR amplification could 
have resulted in an artificial recombination between Rf-PPR591 and Rf-PPR592 due 
to their high similarity, the Rf-PPR592 PCR product was resequenced as a control 
experiment. The sequences of three rf-PPR592 and Rf-PPR592 clones were 
determined. No evidence of recombination was found in any of the sequenced Rf- 

15 PPR592 clones, thus precluding PCR amplification as the source of the genetic 
mosaic found in the rf-PPR592 ORF. 

[0139] rf-PPR592 carries a 530-nt deletion from -556 to -27 relative to the 

start codon of Rf-PPR592. This deletion is responsible for the observed difference in 
the sizes of the respective amplicons. Rf-PPR591 has a 49-nt gap within the same 

20 region, from -273 to -224 relative to the start codon of Rf-PPR592 (Figure 2B). 

[0140] RT-PCR experiments were performed to determine whether both Rf- 

PPR592 and rf-PPR592 are expressed in Petunia floral buds. The RT reaction was 
performed with Superscript II RNase H- reverse transcriptase (GIBCO), and the PCR 
was performed with the PfuTurbo Hotstart DNA polymerase. The reverse primer R3 

25 used for reverse transcription (RT)-PCR lies in the 3 f untranslated region of the Rf- 
PPR592 gene at position +430 to +454. The forward primer used for the PCR lies in 
the coding sequence and is specific to the rf or Rf allele, F2S or F2, respectively, 
because of DNA polymorphisms between rf and Rf in this area. Primer pairs F2SR3 
amplified a 1333-bp product and F2R3 amplified a 1507-bp product. R3, 5'- 

30 TG AAAATGAC AATCGTAAC AGAAAA-3 1 (SEQ ID NO: 48); F2, 5'- 
AAC ATTCCTCC AGACATT ATTAC A-3 1 (SEQ ID NO: 49); F2S, 5'- 
GACGCTGAGGAAATAATGAGATAC-3 1 (SEQ ID NO: 50). 
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[0141] An Rf-PPR592 transcript was detected in floral buds in lines carrying 

the Rf allele, but no transcripts of rf-PPR592 were detected in a homozygous 
nonrestoring rf/rf line (Figure 3A). The absence of the upstream 530-nt region in rf- 
PPR592 is likely to prevent the expression of PPR592 in the floral buds of 
5 nonrestoring lines. 

[0142] Since rf-PPR592 encodes a protein that is very similar to the one 

encoded by Rf-PPR592, a survey of its expression was conducted in tissues other than 
the floral buds. From all of the tissues analyzed, an rf-PPR592 transcript was detected 
only in roots of a nonrestoring rf rf line (Figure 3B). 

10 [0143] A deletion of 530 nt in the promoter area of the rf-PPR592 gene is the 

likely cause of its nonexpression in the floral buds of CMS plants. That the rf- 
PPR592 gene, which encodes a protein 98% similar to Rf-PPR591 and 94% similar to 
Rf-PPR592, has not yet accumulated missense mutations suggests either a recent 
deletion in the promoter or a functional expression in plant organs other than the floral 

15 buds. This latter possibility was supported by the finding of an rf-PPR592 transcript 
in the roots of homozygous nonrestoring rflrfline. 

[0144] Sequence inspection demonstrated that a recombination event between 

two genes similar to R/-PPR591 and R/-PPR592 can explain the formation of rf- 
PPR592. Perhaps once R/-PPR592 was generated and happened to prevent the 

20 expression of pcf, its maintenance required the presence of the CMS-associated gene. 
The absence of the CMS-associated gene in new nucleocytoplasmic combinations 
might have resulted in recombination between R/-PPR591 and R/-PPR592 because of 
their high similarity. In Brassica and related genera, Rfn is found only in association 
with the nap cytoplasm, suggesting that the evolutionary appearance of the nap 

25 cytoplasm and the attending male sterility may have provided the selective pressure 
for the origin, and possibly the continued presence, of Rfn inB, napus (Li et al., 
"Restorer Genes for Different Forms of Brassica Cytoplasmic Male Sterility Map to a 
Single Nuclear Locus That Modifies Transcripts of Several Mitochondrial Genes," 
Proc. Natl. Acad. Sci. USA, 95:10032-10037 (1998), which is hereby incorporated by 

30 reference in its entirety). Sampling of more rf-PPR592 genes from different Petunia 
species should help in understanding the evolution of CMS and fertility restoration in 
this genus. 



WO 03/057859 



PCT/US03/00887 



-66- 

Example 3 -Rf-PPR592 Is Able to Restore Fertility to CMS Plants 

[0145] A sequence encoding the N-terminal 44 aa of Rf-PPR592 was inserted 

5' to the green fluorescent protein (GFP) sequence in the pOL vector (Peeters et al. 5 
5 "Duplication and Quadruplication of Arabidopsis Thaliana Cysteinyl- and 

Asparaginyl-tRNA Synthetase Genes of Organellar Origin," J. Mol. EvoU 50:413- 
423 (2000), which is hereby incorporated by reference in its entirety) to use in 
transient assay of protein localization. As a control, a vector carrying GFP fused with 
a known mitochondrial coxIV transit peptide (Akashi et al. 5 "Potential Dual Targeting 

10 of an Arabidopsis Archaebacterial-Like Histidyl-Trna Synthetase to Mitochondria and 
Chloroplasts," FEBS Lett., 43 1 :39-44 (1 998), which is hereby incorporated by 
reference in its entirety) was also used in the transient assays. DNAs of GFP 
constructs were bombarded into onion epidermal cells as described in Scott et al., 
"Model System For Plant Cell Biology: GFP Imaging In Living Onion Epidermal 

15 Cells," BioTechniques, 26:1 125, 1 128-1 132 (1999), which is hereby incorporated by 
reference in its entirety. 

[0146] For the stable transformation experiments, genomic DNA from the Rf- 

PPR592 gene was amplified from the SB5 BIBAC clone with the PfuTurbo Hotstart 
DNA polymerase and the primers Fl 1-XbaI (5'- 

20 TCTAGAAAAAATGAAGGGGGAATCAAT-3'; SEQ ID NO: 51) and Rll-EcoRI 
(5 f -GAATTCACTTTGCTCTCACGATAAACTAAGA-3 f ;SEQ ID NO: 52) 
(underlined are the restriction sites added to the 5* end of the primers for further use in 
the cloning of the PCR product). The PCR product was first cloned into the pCR- 
Blunt II-TOPO vector, and its sequence was checked to be free of possible mutations 

25 generated by the polymerase. The PCR product was then released from the pCR- 

Blunt II-TOPO vector by digestion with Xbal and EcoRI, gel purified, and cloned into 
Xbal/EcoRI-digested binary vector pGPTVKan (Becker et al., "New Plant Binary 
Vectors With Selectable Markers Located Proximal to the Left T-DNA Border," Plant 
MoL Biol., 20:1 195-1 197 29 (1992), which is hereby incorporated by reference in its 

30 entirety). Petunia transformation and regeneration were performed as described in 
Horsch et al., "A Simple and General-Method for Transferring Genes Into Plants," 
Science, 227:1229-1231 30 (1985), which is hereby incorporated by reference in its 
entirety. Transformants were selected on 300 mg/liter kanamycin, 100 mg/liter 
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ticarcillin/clavulanic acid (15:1, Duchefa Biochemie, Harlem, The Netherlands). 
Shoots were rooted on N13 medium (O'Connell et al., "Somatic Hybridization 
Between Lycopersicon-Esculentum and Lycopersicon-Pennellii," Theor. Appl. 
Genet., 70:1-12 (1985), which is hereby incorporated by reference in its entirety) 
5 before transfer to soil. 

[0147] To determine whether Rf-PPR592 could restore fertility to rffrf CMS 

lines, a 4.6-kb fragment carrying the entire coding region was introduced into the 
binary vector pGPTVKan. This fragment carries 2007 nt upstream of the start codon 
and 861 nt downstream of the stop codon. The pGPTVKan-4.6 kb Rf-PPR592 vector 

10 was transferred into A. tumefaciens strain LBA4404, which was used to transform a 
P. parodii rf/rf CMS line (Figure 4A) and a P. hybrida rf/rf CMS line (Figure 4C). 
More than two dozen independent transformants were obtained and grown to 
flowering. Fertile transformants were observed after transformation of both lines 
(Figures 4B and D). Among these were several fertile transformants carrying a single 

15 copy of the introduced Rf-PPR592 genomic DNA. Flowers of one of the P. parodii 
primary transformant plants were selfed, and a population of 40 Tl progeny was 
grown to flowering. 

[0148] DNA extractions and Southern blotting were performed as described in 

Bentolila et al., "Locating the Petunia Rf Gene on a 650 kb DNA Fragment," Theor. 

20 Appl. Genet., 96:980-988 (1998), which is hereby incorporated by reference in its 
entirety. Floral bud protein was prepared for cell culture protein as described in 
Kohler et al., "The Green Fluorescent Protein as a Marker to Visualize Plant 
Mitochondria in vivo," Plant Journal, 1 1:613-621 (1997), which is hereby 
incorporated by reference in its entirety. After separation by SDS/PAGE (15%), 

25 immunoblots on Hybond-P poly(vinylidene difluoride) membranes (PVDF; 

Amersham Pharmacia, Picataway, NJ) were prepared as previously described (Reed et 
al., "High-Level Expression of a Synthetic Red-Shifted GFP Coding Region 
Incorporated into Transgenic Chloroplasts," Plant J., 27:257-2653 (2001), which is 
hereby incorporated by reference in its entirety) and probed with a 1 :5000 dilution of 

30 the anti-PCF antibody (Nivison et al., "Sequencing, Processing, and Localization of 
the Petunia CMS-Associated Mitochondrial Protein," Plant J., 5:613-623 (1994), 
which is hereby incorporated by reference in its entirety). 
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[0149] DNA blot hybridization revealed that the fertile phenotype 

cosegregated with the Rf-PPR592 transgene (Figure 5A). The Tl progeny were also 
surveyed for the presence of the CMS-associated 19.5-kDa PCF protein. The 19.5- 
kDa protein was found to be decreased about 10-fold in fertile progeny restored by 
5 Rf-PPR592 relative to sterile progeny and the parental CMS line (Figure 5B). Thus, 
Rf-PPR592 was capable of restoring fertility by decreasing the amount of the PCF 
protein. 

[0150] The cloning of a gene that can restore fertility to male-sterile Petunia 

lines will facilitate elucidation of the mechanism by which expression of the CMS- 

10 associated mitochondrial gene is suppressed. The reduced amount of the PCF protein 
could be due to a reduction in the abundance of one of the Petunia CMS-associated 
transcripts, which was reported previously (Pruitt et al., "Transcription of the Petunia 
Mitochondrial CMS-Associated Locus in Male Sterile and Fertility-Restored 
Lines," Mol. Gen. Genet., 227:348-355 (1991), which is hereby incorporated by 

15 reference in its entirety), or to a translation defect that destabilizes the transcript. In 
yeast, mutation in a transcript-specific translation factor destabilizes the particular 
transcript with which the factor normally interacts (Poutre et al., "PET1 1 1 , a 
Saccharomyces cerevisiae Nuclear Gene Required for Translation of the 
Mitochondrial mRNA Encoding Cytochrome C Oxidase Subunit II," Genetics, 

20 1 1 5:637-647 (1987), which is hereby incorporated by reference in its entirety). 

[0151] A number of fertility restorer genes in other species are known to alter 

transcript profiles and mitochondrial gene product accumulation (Moneger et al., 
"Nuclear Restoration of Cytoplasmic Male Sterility in Sunflower is Associated with 
the Tissue-Specific Regulation of a Novel Mitochondrial Gene," EMBO J.. 13:8-17 

25 (1994); Singh et al., <c Nuclear Genes Associated With a Single Brassica CMS Restorer 
Locus Influence Transcripts of Three Different Mitochondrial Gene Regions," 
Genetics, 143:505-516 (1996); Dewey et al. 5 "Novel Recombinations in the Maize 
Mitochondrial Genome Produce a Unique Transcriptional Unit in the Texas Male- 
Sterile Cytoplasm," CelL 44:439-49 (1986); Wise et al., "Mitochondrial Transcript 

30 Processing and Restoration of Male Fertility in T-Cytoplasm Maize," J Hered, 

90:380-385 (1999), which are hereby incorporated by reference in their entirety). In 
addition to the molecular phenotype of restoration, the Petunia Rf 'locus and Rf loci 
from other species may be similar in genomic organization (Li et al., "Restorer Genes 
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for Different Forms of Brassica Cytoplasmic Male Sterility Map to a Single Nuclear 
Locus That Modifies Transcripts of Several Mitochondrial Genes," Proc. Natl. Acad. 
Sci. USA, 95:10032-10037 (1998); Tang et al., "Cosegregation of Single Genes 
Associated with Fertility Restoration and Transcript Processing of Sorghum 
Mitochondrial orfl07 and urf209? Genetics, 150:383-391 (1998), which are hereby 
incorporated by reference in their entirety). The identification of Petunia if/as a PPR 
family member suggests that searching for PPR motif genes near known restorer loci 
should be a useful strategy to identify candidate restorer genes in other species. 
Further studies of R/-PPR592 and other PPR motif-containing genes in plants, fungi, 
and animals will be required to determine whether the motif has a direct role in RNA- 
protein and/or protein-protein interactions. 

Example 4 - Use of R/-PPR592 or its Homologs/Derivatives to Create Novel 
Floral Structures 

[0152] In Petunia, recombination events near the Rf locus in standard sexual 

crosses resulted in plants with abnormal floral appearance. Moreover, a few of the 
initial transgenic plants transformed by Rf-PPR592 produced flowers with abnormal 
appearance. Furthermore, a number of transgenic plants transformed by Rf-PPR592 
and Rf-PPR591 exhibit abnormalities in floral and vegetative structures. An example 
of abnormal flowers seen in some transgenic plants are shown in Figure 6. 

Example 5 - Identification of a Rice Fertility Restorer Gene 

[0153] The complete rice genome sequence, which has been deposited in 

EMBL/GenBank/DDBJ, was examined for genes similar to the petunia Rf gene, using 
BLASTP. The gene most similar to the petunia Rf 'locus was termed as i?ice Aomolog 
of Petunia restorer 1 {Rhprl). This gene is located very close to the rice R/4 marker 
CI 261. There were a total of 10 PPR genes in the vicinity of this marker on rice 
chromosome 10, which were termed as Rhprl to RhprlO (SEQ ID NOs: 20, 22, 24, 
26, 28, 30, 32, 34, 36, and 38). 

[0154] The most immediate usefulness of identification of the rice restorer 

gene is in marker-assisted selection. This facilitates introduction of the natural wild 
abortive-restorer Rf4 gene, which can restore fertility to the wild abortive cytoplasm 



WO 03/057859 



PCT/US03/00887 



-70- 



by traditional crosses into elite breeding lines for use as a parent in a three-line 
breeding scheme. This use of the information does not involve genetically modified 
organisms and, therefore, can proceed without any of the attendant issues. Random 
screening has identified some molecular markers that are already being used to 
transfer i?/genes in certain nuclear backgrounds (Ichikawa et al., "A Rapid PCR- 
Aided Selection of a Rice Line Containing the Rf-I Gene Which is Involved in 
Restoration of the Cytoplasmic Male Sterility/' Molecular Breeding , 3:195-202 
(1997); Jing et al., "Mapping Fertility-Restoring Genes of Rice WA Cytoplasmic 
Male Sterility Using SSLP Markers," Bot. Bull. Acad. Sin. , 42:167-171 (2001), which 
are hereby incorporated by reference in their entirety). Knowing the actual Rfgene 
sequence makes laborious screening for markers suitable between different breeding 
lines unnecessary. 

[0155] The next possibility is to more rapidly transfer the Rf4 gene into 

existing elite breeding lines by transformation rather than by sexual crosses. In such a 
strategy, the entire natural Rf4 gene would be used to transform a rice line for the 
three-line hybrid rice production method. 

[0156] Because the three-line method for hybrid rice production requires time- 

consuming breeding and labor, presently there are attempts to exploit temperature- 
sensitive male sterility mutants for a two-line method of hybrid seed production. The 
three-line method for hybrid rice production involves construction of three lines. Two 
lines are backcrossed repeatedly so that they contain the same nuclear genome. One 
contains the CMS cytoplasm ("CMS parent") and is male sterile while the other 
("Fertile Maintainer") contains the normal cytoplasm but no restorer of fertility (Rf) 
alleles. By crossing the maintainer as male and the CMS line as female, seeds of the 
CMS line with a known nuclear background can be produced in large quantity. The 
third line is homozygous for one or more fertility restoration loci. Hybrid seed is 
produced by crossing the third line with the CMS parent. The nuclear genomes of the 
third line and the CMS line are selected by breeders to optimize heterosis and 
desirable characteristics for the region in which the hybrid rice will be grown. 
[0157] Rice plants have been found that contain a mutant allele that encodes 

male sterility at high temperatures but fertility at low temperatures (Dong et al., 
"Molecular Mapping of a Rice Gene Conditioning Thermosensitive Genie Male 
Sterility Using AFLP, RFLP and SSR Techniques," Theor Appl Genet., 100:727-734 
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(2000), which is hereby incorporated by reference in its entirety). By growing the rice 
at high temperatures, it can be used as the sterile parent in a cross with an elite 
breeding line. The mutant rice can be propagated by selfing when grown at low 
temperatures. The use of this method in the field on large-scale has not been reported 
5 in the literature, so the feasibility of using a natural temperature-sensitive mutant is 
not known. 

[0158] A cloned R/4 gene could also be used in a two-line method. In this 

scheme, the Rf4 gene regulatory sequences would be engineered so that it could be 
turned on when desired. Then, both a CMS line and a maintainer line, which requires 

10 multiple crosses over a number of years to produce, are not needed. A single line, the 
CMS line containing the engineered Rf4 gene, would serve both as CMS parent and as 
its own maintainer line (Figure 8). The CMS line would be propagated by selfing by 
turning on the Rf4 gene. Without induction of the engineered Rf4 gene, however, the 
line would be sterile and therefore could be used as a CMS parent. 

15 [0159] Although the invention has been described in detail for the purpose of 

illustration, it is understood that such detail is solely for that purpose, and variations 
can be made therein by those skilled in the art without departing from the spirit and 
scope of the invention which is defined by the following claims. 



WO 03/057859 



PCT/US03/00887 



-72- 

WHAT IS CLAIMED: 

1 . An isolated nucleic acid molecule which restores fertility to 
cytoplasmic male sterile plants and modifies expression of toxic mitochondria 
proteins by the plant, wherein the nucleic acid molecule: (1) encodes a protein having 
an amino acid sequence selected from the group consisting of SEQ ID NOs: 2, 21, 23, 
25, 27, 29, 31, 33, 35, 37, 39, and 41; (2) encodes a protein comprising a motif having 
an amino acid sequence corresponding to any of SEQ ID NOs: 3 to 1 8 or an amino 
acid sequence identified by a METAMEME software using the amino acid sequence 
of SEQ ID NO: 2 as input or an amino acid sequence identified as significantly 
similar to SEQ ID NO: 2 using a NCBI BLAST software (threshold- E less than or 
equal to 1 5) with SEQ ID NO: 2 as input; (3) hybridizes to a nucleotide sequence 
selected from the group consisting of from nucleotide 1982 to 3760 of SEQ ID NO: 1, 
SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, 
SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, 
SEQ ID NO: 38, and SEQ ID NO: 40 under stringent conditions of a hybridization 
buffer comprising 20% formamide in 0.9M saline/0.09M SSC buffer at a temperature 
of 42°C; or (4) has a nucleotide sequence selected from the group consisting of from 
nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID 
NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID 
NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 40. 

2. The isolated nucleic acid molecule according to claim 1 , 
wherein the nucleic acid molecule encodes a protein having an amino acid sequence 
selected from the group consisting of SEQ ID NOs: 2, 21, 23, 25, 27, 29, 31, 33, 35, 
37, 39, and 41. 

3. The isolated nucleic acid molecule according to claim 1, 
wherein the nucleic acid encodes a protein comprising a motif having an amino acid 
sequence corresponding to any of SEQ ID NOs: 3 to 18 or an amino acid sequence 
identified with a METAMEME software using the amino acid sequence of SEQ ID 
NO: 2 as input or an amino acid sequence identified as significantly similar to SEQ 
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ID NO: 2 using a NCBI BLAST software (threshold= E less than or equal to 15) with 
SEQ ID NO: 2 as input. 

4. The isolated nucleic acid molecule according to claim 1, 
wherein the nucleic acid molecule hybridizes to a nucleotide sequence selected from 
the group consisting of from nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 
19, SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 
28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 
38, and SEQ ID NO: 40 under stringent conditions of a hybridization buffer 
comprising 20% formamide in 0.9M saline/0.09M SSC buffer at a temperature of 
42°C. 

5. The isolated nucleic acid molecule according to claim 1, 
wherein the nucleic acid molecule has a nucleotide sequence selected from the group 
consisting of from nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ 
ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ 
ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and 
SEQ ID NO: 40. 

6. The isolated nucleic acid molecule according to claim 1, 
wherein the nucleic acid molecule is from petunia, Arabidopsis thaliana, or rice. 

7. A isolated protein encoded by the nucleic acid molecule 
according to claim 1 . 

8. The isolated protein according to claim 7, wherein the protein 
has an amino acid sequence selected from the group consisting of SEQ ID NOs: 2, 21, 
23, 25, 27, 29, 31, 33, 35, 37, 39, and 41. 

9. The isolated protein according to claim 7, wherein the protein 
encodes a protein comprising a motif having an amino acid sequence corresponding to 
any of SEQ ID NOs: 3 to 18 or an amino acid sequence identified with a 
METAMEME software using the amino acid sequence of SEQ ID NO: 2 as input or 
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an amino acid sequence identified as significantly similar to SEQ ID NO: 2 using a 
NCBI BLAST software (threshold= E less than or equal to 15) with SEQ ID NO: 2 as 
input. 

10. The isolated protein according to claim 7, wherein the nucleic 
acid molecule hybridizes to a nucleotide sequence selected from the group consisting 
of from nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, 
SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, 
SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 
40 under stringent conditions of a hybridization buffer comprising 20% formamide in 
0.9M saline/0.09M SSC buffer at a temperature of 42°C. 

11. The isolated protein according to claim 7, wherein the nucleic 
acid molecule has a nucleotide sequence selected from the group consisting of from 
nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID 
NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID 
NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 40. 

12. The isolated protein according to claim 7, wherein the nucleic 
acid molecule is from petunia, Arabidopsis thaliana, or rice. 

13. An isolated expression system comprising the nucleic acid 
molecule of claim 1 . 

14. The isolated expression system according to claim 13, wherein 
the nucleic acid molecule is in proper sense orientation. 

15. An isolated host cell comprising the nucleic acid molecule of 

claim 1. 

16. The isolated host cell according to claim 15, wherein the 
nucleic acid molecule is in an expression system. 
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1 7. The isolated host cell according to claim 15, wherein the host 
cell is a plant cell. 

18. The isolated host cell according to claim 1 5, wherein the host 
cell is a bacterial cell. 

19. A transgenic plant transformed with the nucleic acid molecule 
according to claim 1 . 

20. The transgenic plant according to claim 19, wherein the nucleic 
acid molecule encodes a protein having an amino acid sequence selected from the 
group consisting of SEQ ID NOs: 2, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, and 41. 

2 1 . The transgenic plant according to claim 1 9, wherein the nucleic 
acid encodes a protein comprising a motif having an amino acid sequence 
corresponding to any of SEQ ID NOs: 3 to 18 or an amino acid sequence identified 
with a METAMEME software using the amino acid sequence of SEQ ID NO: 2 as 
input or an amino acid sequence identified as significantly similar to SEQ ID NO: 2 
using aNCBI BLAST software (threshold= E less than or equal to 15) with SEQ ID 
NO: 2 as input. 

22. The transgenic plant according to claim 19, wherein the nucleic 
acid molecule hybridizes to a nucleotide sequence selected from the group consisting 
of from nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, 
SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, 
SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 
40 under stringent conditions of a hybridization buffer comprising 20% formamide in 
0.9M saline/0.09M SSC buffer at a temperature of 42°C. 

23 . The transgenic plant according to claim 1 9, wherein the nucleic 
acid molecule has a nucleotide sequence selected from the group consisting of from 
nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID 
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NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID 
NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 40. 

24. The transgenic plant according to claim 19, wherein the nucleic 
acid molecule is from petunia, Arabidopsis thaliana, or rice. 

25. The transgenic plant according to claim 19, wherein the 
transgenic plant is a crop plant. 

26. The transgenic plant according to claim 25, wherein the crop 
plant is selected from the group consisting of alfalfa, rice, wheat, barley, rye, cotton, 
sunflower, peanut, corn, potato, sweet potato, bean, pea, chicory, lettuce, endive, 
cabbage, brussel sprout, beet, parsnip, turnip, cauliflower, broccoli, radish, spinach, 
onion, garlic, eggplant, pepper, celery, carrot, squash, pumpkin, zucchini, cucumber, 
apple, pear, melon, citrus, strawberry, grape, raspberry, pineapple, soybean, tobacco, 
tomato, sorghum, and sugarcane. 

27. The transgenic plant according to claim 19, wherein the 
transgenic plant is an ornamental plant. 

28. The transgenic plant according to claim 27, wherein the 
ornamental plant is selected from the group consisting of Arabidopsis thaliana, 
Saintpaulia, petunia, pelargonium, poinsettia, chrysanthemum, carnation, and zinnia. 

29. A transgenic plant seed transformed with the nucleic acid 
molecule according to claim 1 . 

30. The transgenic plant seed according to claim 29, wherein the 
nucleic acid molecule encodes a protein having an amino acid sequence selected from 
the group consisting of SEQ ID NOs: 2, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, and 41. 



3 1 . The transgenic plant seed according to claim 29, wherein the 
nucleic acid encodes a protein comprising a motif having an amino acid sequence 
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corresponding to any of SEQ ID NOs: 3 to 18 or an amino acid sequence identified 
with a METAMEME software using the amino acid sequence of SEQ ID NO: 2 as 
input or an amino acid sequence identified as significantly similar to SEQ ID NO: 2 
using aNCBI BLAST software (threshold= E less than or equal to 15) with SEQ ID 
NO: 2 as input. 

32. The transgenic plant seed according to claim 29, wherein the 
nucleic acid molecule hybridizes to a nucleotide sequence selected from the group 
consisting of from nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ 
ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ 
ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and 
SEQ ID NO: 40 under stringent conditions of a hybridization buffer comprising 20% 
formamide in 0.9M saline/0.09M SSC buffer at a temperature of 42°C. 

33. The transgenic plant seed according to claim 29, wherein the 
nucleic acid molecule has a nucleotide sequence selected from the group consisting of 
from nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, 
SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, 
SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 
40. 

34. The transgenic plant seed according to claim 29, wherein the 
nucleic acid molecule is from petunia, Arabidopsis thaliana, or rice. 

35. The transgenic plant seed according to claim 29, wherein the 
transgenic plant is a crop plant. 

36. The transgenic plant seed according to claim 35, wherein the 
crop plant is selected from the group consisting of alfalfa, rice, wheat, barley, rye, 
cotton, sunflower, peanut, corn, potato, sweet potato, bean, pea, chicory, lettuce, 
endive, cabbage, brussel sprout, beet, parsnip, turnip, cauliflower, broccoli, radish, 
spinach, onion, garlic, eggplant, pepper, celery, carrot, squash, pumpkin, zucchini, 
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cucurnber, apple, pear, melon, citrus, strawberry, grape, raspberry, pineapple, 
soybean, tobacco, tomato, sorghum, and sugarcane. 

37. The transgenic plant seed according to claim 29, wherein the 
transgenic plant is an ornamental plant. 

38. The transgenic plant seed according to claim 37, wherein the 
ornamental plant is selected from the group consisting of Arabidopsis thaliana, 
Saintpaulia, petunia, pelargonium, poinsettia, chrysanthemum, carnation, and zinnia. 

39. A method of restoring fertility to cytoplasmic male sterile 
plants comprising: 

transforming a cytoplasmic male sterile plant with a nucleic 
acid molecule according to claim 1 under conditions effective to restore fertility to the 
cytoplasmic male sterile plant. 

40. The method according to claim 39, wherein the nucleic acid 
molecule encodes a protein having an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, and 41. 

41 . The method according to claim 39, wherein the nucleic acid 
encodes a protein comprising a motif having an amino acid sequence corresponding to 
any of SEQ ID NOs: 3 to 18 or an amino acid sequence identified with a 
METAMEME software using the amino acid sequence of SEQ ID NO: 2 as input or 
an amino acid sequence identified as significantly similar to SEQ ID NO: 2 using a 
NCBI BLAST software (threshold= E less than or equal to 15) with SEQ ID NO: 2 as 
input. 

42. The method according to claim 39, wherein the nucleic acid 
molecule hybridizes to a nucleotide sequence selected from the group consisting of 
from nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, 
SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, 
SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 
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40 under stringent conditions of a hybridization buffer comprising 20% formamide in 
0.9M saline/0.09M SSC buffer at a temperature of 42°C. 

43. The method according to claim 39, wherein the nucleic acid 
molecule has a nucleotide sequence selected from the group consisting of from 
nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID 
NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID 
NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 40. 

44. The method according to claim 39, wherein the nucleic acid 
molecule is from petunia, Arabidopsis thaliana, or rice. 

45. The method according to claim 39, wherein the plant is a crop 

plant. 

46. The method according to claim 45, wherein the crop plant is 
selected from the group consisting of alfalfa, rice, wheat, barley, rye, cotton, 
sunflower, peanut, corn, potato, sweet potato, bean, pea, chicory, lettuce, endive, 
cabbage, brussel sprout, beet, parsnip, turnip, cauliflower, broccoli, radish, spinach, 
onion, garlic, eggplant, pepper, celery, carrot, squash, pumpkin, zucchini, cucumber, 
apple, pear, melon, citrus, strawberry, grape, raspberry, pineapple, soybean, tobacco, 
tomato, sorghum, and sugarcane. 

47. The method according to claim 39, wherein the plant is an 
ornamental plant. 

48. The method according to claim 47, wherein the ornamental 
plant is selected from the group consisting of Arabidopsis thaliana, Saintpaulia, 
petunia, pelargonium, poinsettia, chrysanthemum, carnation, and zinnia. 

49. The method according to claim 39, wherein the plant has 2 or 
more copies of the nucleic acid molecule. 
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50. A method of identifying a candidate plant suitable for breeding 
with a cytoplasmic male sterile plant comprising: 

analyzing the candidate plant for the presence, in its genome, of 
a nucleic acid molecule according to claim 1 . 

51 . The method according to claim 50, wherein the nucleic acid 
molecule encodes a protein having an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, and 41. 

52. The method according to claim 50, wherein the nucleic acid 
encodes a protein comprising a motif having an amino acid sequence corresponding to 
any of SEQ ID NOs: 3 to 18 or an amino acid sequence identified with a 
METAMEME software using the amino acid sequence of SEQ ID NO: 2 as input or 
an amino acid sequence identified as significantly similar to SEQ ID NO: 2 using a 
NCBI BLAST software (threshold= E less than or equal to 15) with SEQ ID NO: 2 as 
input. 

53. The method according to claim 50, wherein the nucleic acid 
molecule hybridizes to a nucleotide sequence selected from the group consisting of 
from nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, 
SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, 
SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 
40 under stringent conditions of a hybridization buffer comprising 20% formamide in 
0.9M saline/0.09M SSC buffer at a temperature of 42°C. 

54. The method according to claim 50, wherein the nucleic acid 
molecule has a nucleotide sequence selected from the group consisting of from 
nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID 
NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID 
NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 40. 

55. The method according to claim 50, wherein the nucleic acid 
molecule is from petunia, Arabidopsis thaliana, or rice. 



WO 03/057859 



PCT/US03/00887 



-81 - 



56. 



The method according to claim 50, wherein the plant is a crop 



plant. 



57. 



The method according to claim 56, wherein the crop plant is 



selected from the group consisting of alfalfa, rice, wheat, barley, rye, cotton, 
sunflower, peanut, corn, potato, sweet potato, bean, pea, chicory, lettuce, endive, 
cabbage, brussel sprout, beet, parsnip, turnip, cauliflower, broccoli, radish, spinach, 
onion, garlic, eggplant, pepper, celery, carrot, squash, pumpkin, zucchini, cucumber, 
apple, pear, melon, citrus, strawberry, grape, raspberry, pineapple, soybean, tobacco, 
tomato, sorghum, and sugarcane. 

58. The method according to claim 50, wherein the plant is an 
ornamental plant. 

59. The method according to claim 58, wherein the ornamental 
plant is selected from the group consisting of Arabidopsis thaliana, Saintpaulia, 
petunia, pelargonium, poinsettia, chrysanthemum, carnation, and zinnia. 

60. A method of identifying a candidate gene restoring fertility in 
plants, said method comprising: 

analyzing the candidate gene for the presence of a nucleic acid 
molecule according to claim 1 . 

61. The method according to claim 60, wherein the nucleic acid 
molecule encodes a protein having an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, and 41 . 

62. The method according to claim 60, wherein the nucleic acid 
encodes a protein comprising a motif having an amino acid sequence corresponding to 
any of SEQ ID NOs: 3 to 18 or an amino acid sequence identified with a 
METAMEME software using the amino acid sequence of SEQ ID NO: 2 as input or 
an amino acid sequence identified as significantly similar to SEQ ID NO: 2 using a 
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NCBI BLAST software (threshold= E less than or equal to 15) with SEQ ID NO: 2 as 
input. 

63. The method according to claim 60, wherein the nucleic acid 
molecule hybridizes to a nucleotide sequence selected from the group consisting of 
from nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, 
SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, 
SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 
40 under stringent conditions of a hybridization buffer comprising 20% formamide in 
0.9M saline/0.09M SSC buffer at a temperature of 42°C. 

64. The method according to claim 60, wherein the nucleic acid 
molecule has a nucleotide sequence selected from the group consisting of from 
nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID 
NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID 
NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 40. 

65. The method according to claim 60, wherein the nucleic acid 
molecule is from petunia, Arabidopsis thaliana, or rice. 

66. The method according to claim 60, wherein the plant is a crop 

plant. 

67. The method according to claim 66, wherein the crop plant is 
selected from the group consisting of alfalfa, rice, wheat, barley, rye, cotton, 
sunflower, peanut, corn, potato, sweet potato, bean, pea, chicory, lettuce, endive, 
cabbage, brussel sprout, beet, parsnip, turnip, cauliflower, broccoli, radish, spinach, 
onion, garlic, eggplant, pepper, celery, carrot, squash, pumpkin, zucchini, cucumber, 
apple, pear, melon, citrus, strawberry, grape, raspberry, pineapple, soybean, tobacco, 
tomato, sorghum, and sugarcane. 

68. The method according to claim 60, wherein the plant is an 
ornamental plant. 
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69. The method according to claim 68, wherein the ornamental 
plant is selected from the group consisting of Arabidopsis thaliana, Saintpaulia, 
petunia, pelargonium, poinsettia, chrysanthemum, carnation, and zinnia. 

70. A method of producing hybrid plant seed comprising: 
providing a cytoplasmic male sterile plant; 

providing a second plant comprising a nucleic acid molecule 
according to claim 1; and 

breeding the cytoplasmic male sterile plant and the second 
plant under conditions effective to produce hybrid progeny seed which yield fertile 
plants. 

71 . The method according to claim 70, wherein the nucleic acid 
molecule encodes a protein having an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, and 41. 

72. The method according to claim 70, wherein the nucleic acid 
encodes a protein comprising a motif having an amino acid sequence corresponding to 
any of SEQ ID NOs: 3 to 1 8 or an amino acid sequence identified with a 
METAMEME software using the amino acid sequence of SEQ ID NO: 2 as input or 
an amino acid sequence identified as significantly similar to SEQ ID NO: 2 using a 
NCBI BLAST software (threshold= E less than or equal to 15) with SEQ ID NO: 2 as 
input. 

73. The method according to claim 70, wherein the nucleic acid 
molecule hybridizes to a nucleotide sequence selected from the group consisting of 
from nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, 
SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, 
SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 
40 under stringent conditions of a hybridization buffer comprising 20% formamide in 
0.9M saline/0.09M SSC buffer at a temperature of 42°C. 



WO 03/057859 



PCT/US03/00887 



-84- 

74. The method according to claim 70, wherein the nucleic acid 
molecule has a nucleotide sequence selected from the group consisting of from 
nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID 
NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID 
NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 40. 

75. The method according to claim 70, wherein the nucleic acid 
molecule is from petunia, Arabidopsis thaliana, or rice. 

76. The method according to claim 70, wherein the plants are crop 

plants. 

77. The method according to claim 76, wherein the crop plants are 
selected from the group consisting of alfalfa, rice, wheat, barley, rye, cotton, 
sunflower, peanut, com, potato, sweet potato, bean, pea, chicory, lettuce, endive, 
cabbage, brussel sprout, beet, parsnip, turnip, cauliflower, broccoli, radish, spinach, 
onion, garlic, eggplant, pepper, celery, carrot, squash, pumpkin, zucchini, cucumber, 
apple, pear, melon, citrus, strawberry, grape, raspberry, pineapple, soybean, tobacco, 
tomato, sorghum, and sugarcane. 

78. The method according to claim 70, wherein the plants are 
ornamental plants. 

79. The method according to claim 78, wherein the ornamental 
plants are selected from the group consisting of Arabidopsis thaliana, Saintpaulia, 
petunia, pelargonium, poinsettia, chrysanthemum, carnation, and zinnia. 

80. The hybrid progeny seed produced by the method according to 

claim 70. 



8 1 . The hybrid progeny plants grown from the hybrid progeny seed 
produced by the method according to claim 70. 
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82. A method of producing plant seeds for an inbred line of plants 

comprising: 

providing a cytoplasmic male sterile plant; 

providing a second plant comprising a nucleic acid molecule 

according to claim 1 ; 

breeding the cytoplasmic male sterile plant and the second 
plant under conditions effective to produce hybrid progeny seed which yield fertile 
plants; 

producing hybrid fertile plants from the hybrid progeny seeds; 

and 

backcrossing the hybrid fertile plants and the second plant to 
produce seed which yield inbred progeny plants. 

83. The method according to claim 82, wherein the nucleic acid 
molecule encodes a protein having an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, and 41. 

84. The method according to claim 82, wherein the nucleic acid 
encodes a protein comprising a motif having an amino acid sequence corresponding to 
any of SEQ ID NOs: 3 to 18 or an amino acid sequence identified with a 
METAMEME software using the amino acid sequence of SEQ ID NO: 2 as input or 
an amino acid sequence identified as significantly similar to SEQ ID NO: 2 using a 
NCBI BLAST software (threshold= E less than or equal to 15) with SEQ ID NO: 2 as 
input. 

85. The method according to claim 82, wherein the nucleic acid 
molecule hybridizes to a nucleotide sequence selected from the group consisting of 
from nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, 
SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, 
SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 
40 under stringent conditions of a hybridization buffer comprising 20% formamide in 
0.9M saline/0.09M SSC buffer at a temperature of 42°C. 
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86. The method according to claim 82, wherein the nucleic acid 
molecule has a nucleotide sequence selected from the group consisting of from 
nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID 
NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID 
NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 40. 

87. The method according to claim 82, wherein the nucleic acid 
molecule is from petunia, Arabidopsis thaliana, or rice. 

88. The method according to claim 82, wherein the plants are crop 

plants. 

89. The method according to claim 88, wherein the crop plants are 
selected from the group consisting of alfalfa, rice, wheat, barley, rye, cotton, 
sunflower, peanut, com, potato, sweet potato, bean, pea, chicory, lettuce, endive, 
cabbage, brussel sprout, beet, parsnip, turnip, cauliflower, broccoli, radish, spinach, 
onion, garlic, eggplant, pepper, celery, carrot, squash, pumpkin, zucchini, cucumber, 
apple, pear, melon, citrus, strawberry, grape, raspberry, pineapple, soybean, tobacco, 
tomato, sorghum, and sugarcane. 

90. The method according to claim 82, wherein the plants are 
ornamental plants. 

9 1 . The method according to claim 90, wherein the ornamental 
plants are selected from the group consisting of Arabidopsis thaliana^ Saintpaulia, 
petunia, pelargonium, poinsettia, chrysanthemum, carnation, and zinnia. 

92. The seed which yield inbred progeny plants produced by the 
method according to claim 82. 

93. The inbred progeny plants grown from the seed which yield 
inbred progeny plants produced by the method of claim 82. 
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94. A method of directing gene expression to plant mitochondria 

comprising: 

transforming a plant with a chimeric nucleic acid molecule 
comprising a transgene operatively linked to a promoter or a terminator from a plant 
gene which restores fertility to cytoplasmic male sterile plants and modifies 
expression of toxic mitochondria proteins by the plant under conditions effective to 
direct expression of the transgene in the mitochondria of the transformed plant, 
wherein the promoter has a nucleotide sequence of from nucleotide 1 to 1981 of SEQ 
ID NO: 1 and the terminator has a nucleotide sequence of from nucleotide 3761 to 
4593 of SEQ ID NO: 1. 

95. The method according to claim 94, wherein the method is 
carried out with a promoter having a nucleotide sequence of from nucleotide 1 to 
1981 of SEQ ID NO: 1. 

96. The method according to claim 94, wherein the method is 
carried out with a terminator having a nucleotide sequence of from nucleotide 3761 to 
4593 of SEQ ID NO: 1. 

97. The method according to claim 94, wherein the plant is a crop 

plant. 

98. The method according to claim 97, wherein the crop plant is 
selected from the group consisting of alfalfa, rice, wheat, barley, rye, cotton, 
sunflower, peanut, corn, potato, sweet potato, bean, pea, chicory, lettuce, endive, 
cabbage, brussel sprout, beet, parsnip, turnip, cauliflower, broccoli, radish, spinach, 
onion, garlic, eggplant, pepper, celery, carrot, squash, pumpkin, zucchini, cucumber, 
apple, pear, melon, citrus, strawberry, grape, raspberry, pineapple, soybean, tobacco, 
tomato, sorghum, and sugarcane. 

99. The method according to claim 94, wherein the plant is an 
ornamental plant. 
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100. The method according to claim 99, wherein the ornamental 
plant is selected from the group consisting of Arabidopsis thaliana, Saintpaulia, 
petunia, pelargonium, poinsettia, chrysanthemum, carnation, and zinnia. 

101. A promoter from a plant gene which restores fertility to 
cytoplasmic male sterile plants and modifies expression of toxic mitochondria 
proteins by the plant under conditions effective to direct expression of the transgene 
in the mitochondria of the transformed plant, wherein the promoter has a nucleotide 
sequence of from nucleotide 1 to 1981 of SEQ ID NO: 1. 

102. A terminator from a plant gene which restores fertility to 
cytoplasmic male sterile plants and modifies expression of toxic mitochondria 
proteins by the plant under conditions effective to direct expression of the transgene 
in the mitochondria of the transformed plant, wherein the terminator has a nucleotide 
sequence of from nucleotide 3761 to 4593 of SEQ ID NO: 1. 

103. A nucleic acid construct comprising: 

a promoter or a terminator from a plant gene which restores 
fertility to cytoplasmic male sterile plants and modifies expression of toxic 
mitochondria proteins by the plant under conditions effective to direct expression of 
the transgene in the mitochondria of the transformed plant and 

a nucleic acid heterologous to and operatively coupled to the 
promoter or the terminator, wherein the promoter has a nucleotide sequence of from 
nucleotide 1 to 1981 of SEQ ID NO: 1 and the terminator has a nucleotide sequence 
of from nucleotide 3761 to 4593 of SEQ ID NO: 1. 

104. A nucleic acid construct according to claim 103, wherein the 
construct is provided with a promoter from a plant gene which restores fertility to 
cytoplasmic male sterile plants and modifies expression of toxic mitochondria 
proteins by the plant under conditions effective to direct expression of the transgene 
in the mitochondria of the transformed plant, said promoter having a nucleotide 
sequence of from nucleotide 1 to 1981 of SEQ ID NO: 1. 
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105. A nucleic acid construct according to claim 103, wherein the 
construct is provided with a terminator from a plant gene which restores fertility to 
cytoplasmic male sterile plants and modifies expression of toxic mitochondria 
proteins by the plant under conditions effective to direct expression of the transgene 
in the mitochondria of the transformed plant, said terminator having a nucleotide 
sequence of from nucleotide 3761 to 4593 of SEQ ID NO: 1. 

1 06. An isolated expression system comprising the nucleic acid 
construct according to claim 103. 

107. An isolated host cell comprising the nucleic acid construct 
according to claim 103. 

108. A transgenic plant comprising the nucleic acid construct 
according to claim 103. 

109. A transgenic plant seed comprising the nucleic acid construct 
according to claim 103. 

110. A method of expressing a gene preferentially in roots of a plant 

comprising: 

transforming a plant with a nucleic acid construct comprising: a 
promoter suitable for driving expression preferentially in roots having a nucleotide 
sequence of from 1 to 1388 of SEQ ID NO: 44; a nucleic acid heterologous to the 
promoter, wherein the promoter is operatively coupled 5' to the nucleic acid to induce 
transcription of the nucleic acid; and a terminator having a nucleotide sequence of 
from nucleotide 3168 to 4016 of SEQ ID NO: 44, wherein the terminator is operably 
coupled 3' to the nucleic acid. 

111. The method according to claim 110, wherein the plant is a crop 

plant. 
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1 12. The method according to claim 111, wherein the crop plant is 
selected from the group consisting of alfalfa, rice, wheat, barley, rye, cotton, 
sunflower, peanut, com, potato, sweet potato, bean, pea, chicory, lettuce, endive, 
cabbage, brussel sprout, beet, parsnip, turnip, cauliflower, broccoli, radish, spinach, 
onion, garlic, eggplant, pepper, celery, carrot, squash, pumpkin, zucchini, cucumber, 
apple, pear, melon, citrus, strawberry, grape, raspberry, pineapple, soybean, tobacco, 
tomato, sorghum, and sugarcane. 

113. The method according to claim 111, wherein the plant is an 
ornamental plant. 

114. The method according to claim 113, wherein the ornamental 
plant is selected from the group consisting of Arabidopsis thaliana, Saintpaulia, 
petunia, pelargonium, poinsettia, chrysanthemum, carnation, and zinnia. 

115. A method of altering plant floral morphology in ornamental 
plants comprising: 

transforming an ornamental plant with a nucleic acid molecule 

according to claim 1. 

116. The method according to claim 1 1 5, wherein the nucleic acid 
molecule encodes a protein having an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, and 41. 

117. The method according to claim 1 15, wherein the nucleic acid 
encodes a protein comprising a motif having an amino acid sequence corresponding to 
any of SEQ ID NOs: 3 to 18 or an amino acid sequence identified with a 
METAMEME software using the amino acid sequence of SEQ ID NO: 2 as input or 
an amino acid sequence identified as significantly similar to SEQ ID NO: 2 using a 
NCBI BLAST software (threshold= E less than or equal to 15) with SEQ ID NO: 2 as 
input. 
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118. The method according to claim 115, wherein the nucleic acid 
molecule hybridizes to a nucleotide sequence selected from the group consisting of 
from nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, 
SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, 
SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 
40 under stringent conditions of a hybridization buffer comprising 20% formamide in 
0.9M saline/0.09M SSC buffer at a temperature of 42°C. 

119. The method according to claim 115, wherein the nucleic acid 
molecule has a nucleotide sequence selected from the group consisting of from 
nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID 
NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID 
NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 40. 

120. The method according to claim 115, wherein the nucleic acid 
molecule is from petunia, Arabidopsis thaliana, or rice. 

121. The method according to claim 115, wherein the plant is an 
ornamental plant. 

122. The method according to claim 121, wherein the ornamental 
plant is selected from the group consisting of Arabidopsis thaliana, Saintpaulia, 
petunia, pelargonium, poinsettia, chrysanthemum, carnation, and zinnia. 

123. A method of producing plants with a cytoplasmic male sterile 
plant restoration system comprising: 

transforming a first plant in its chloroplast genome with a 
nucleic acid which causes the plant to become male sterile; 

transforming a second plant with a nucleic acid molecule 
according to claim 1 whose protein product is targeted to the chloroplast; and 

crossing the first and second plants to produce progeny plants 
possessing a cytoplasmic male sterile plant restoration system. 
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124. The method according to claim 123, wherein the nucleic acid 
molecule encodes a protein having an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, and 41. 

125. The method according to claim 123, wherein the nucleic acid 
molecule encodes a protein comprising a motif having an amino acid sequence 
corresponding to any of SEQ ID NOs: 3 to 1 8 or an amino acid sequence identified 
with a METAMEME software using the amino acid sequence of SEQ ID NO: 2 as 
input or an amino acid sequence identified as significantly similar to SEQ ID NO: 2 
using a NCBI BLAST software (threshold= E less than or equal to 15) with SEQ ID 
NO: 2 as input. 

126. The method according to claim 123, wherein the nucleic acid 
molecule hybridizes to a nucleotide sequence selected from the group consisting of 
from nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, 
SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, 
SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 
40 under stringent conditions of a hybridization buffer comprising 20% formamide in 
0.9M saline/0.09M SSC buffer at a temperature of 42°C. 

127. The method according to claim 123, wherein the nucleic acid 
molecule has a nucleotide sequence selected from the group consisting of from 
nucleotide 1982 to 3760 of SEQ ID NO: 1, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID 
NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID 
NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, SEQ ID NO: 38, and SEQ ID NO: 40. 

128. The method according to claim 123, wherein the nucleic acid 
molecule is from petunia, Arabidopsis thaliana, or rice. 

129. A method of producing plants with a cytoplasmic male sterile 
plant restoration system comprising: 

mutagenizing a first plant having a nucleic acid which encodes 
a protein comprising a motif having an amino acid sequence corresponding to any of 
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SEQ ID NOs: 3 to 18 or an amino acid sequence identified with a METAMEME 
software using the amino acid sequence of SEQ ID NO: 2 as input or an amino acid 
sequence identified as significantly similar to SEQ ID NO: 2 using a NCBI BLAST 
software (threshold= E less than or equal to 15) with SEQ ID NO: 2 as input; 

crossing the mutagenized first plant with a wild-type plant 
having mitochondrial DNA polymorphisms compared to mitochondrial DNA in the 
mutagenized first plant to produce progeny plants; and 

determining if the progeny plants are fertile, whereby fertile 
progeny plants can be used as a fertile maintainer line, wherein the mutagenized first 
plant, the fertile maintainer line, and a wild-type allele present in the first plant before 
mutagenesis comprises a new cytoplasmic male sterile plant restoration system. 

130. An isolated nucleic acid sequence corresponding to SEQ ID 
NO: 42 or SEQ ID NO: 44. 
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Figure 2 
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Figure 3 
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Figure 4 
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Figure 5 
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Figure 6 
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Diagram of a plant genotype 




Nuclear genotype 

Cytoplasmic genotype 





Wild-type plant with wild-type nuclear alleles 
of gene Ms, a PPR motif gene needed for male 
fertility 

Mutagenesis of the Ms gene (e.g by an insertional element, radiation, 
chemicals, etc. 



Heterozygous mutant at Ms locus 



Self-cross 





Mutant male sterile plant 



By sexual crosses or cybridization with genotypes carrying different 
organelle genomes than the initial wt genome, or by mutagenesis of 
genome(s) in the wt cytoplasm, create a genotype with mutant 
ms/ms nuclear alleles that is male fertile 



Male fertile plant 



These genotypes are then utilized as a CMS/restorer system for hybrid seed production and 
breeding as follows: 






CMS line 



Fertile maintainer line 



Fertility restorer lines 



Figure 7A 
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Examples illustrating how new lines can be used as a CMS/restorer system 



Female parent 



Male parent 
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Progeny 




Maintained CMS line 
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Male parent 
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CMS line 



Fertility restorer lines 



Fertile Hybrid 



Figure 7B 
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Figure 8 
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SEQUENCE LISTING 
<110> Cornell Research Foundation, Inc. 

<120> GENES FOR ALTERING MITOCHONDRIAL FUNCTION AND FOR 
HYBRID SEED PRODUCTION 

<130> 19603/3972 

<140> 
<141> 

<150> 60/374,996 
<151> 2002-01-10 

<160> 52 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 4593 
<212> DNA 
<213> Petunia sp . 

<400> 1 

atatatatat acaaactgat tttttctgtc tatLtgcaca gtgttatatt tacataccct 60 

tgaaaaaggg tagctccgct aataatgtta tctttacaaa aaataacaat aattttttta 120 

cataatatat acaaaactca ttgttatgta ttgtaaatat gataaaaata ttgttatttt 180 

ttgtaatata gctattaggt agtcatgtgg tgtaatattt cctaaaaata tttacctgag 240 

tcggccattt ggctaaaaat attttatttt atagtcgcat atactccaag cttgtatatc 300 

ccatagcgac agtataccta tacgatatct tctattattt 'acctttttag tattcgtata 360 

cccccaatag tatacaagtt tgacaccgca atagtgtacc ccaatgttgt ggttgtgtgg 420 

ctataaaatg tttaaacaaa atttgagtga tggtagtgta attttttagt gtaagctggg 480 

tagttttaaa aacattcttt ttgaaaattg tagttcaagt catagtacaa aaaactgaaa 540 

tatttatatg tttcttgatt ttggctggtc ttctaaaaat tttgaaatgc tggctagttt 600 

tcatttagcg aggggcaaat agactacatg gccaaatttt tacgttaaaa gaagtgttgt 660 

ctgggaaagt attcgaaaag attgtacgga caagtgttgc ctagacaaca cgtcaaatta 720 

tgtagaaaaa tgtaggaaga aattcaaaag caaatattgc ttaagcaaaa ggcagtcaaa 780 

gacaatgctg ccttagggag tgagaaatgg gcatcactat aagattgtat ttccattcga 840 

tatttattca ttataaactt aaggaaaagt gcaaggaaaa gccacttttt ggcttgcctt 900 

taccgttgaa gctactttca aagaaaaaga gctagttttt agcttttttg gaactttaat 960 

cattgtgggc cgaacttcag accttgtggg ccgaacttca tacattcaca agtaaaaaat 1020 

tagctcacag gccactttta ccactagtat ttggtttgaa gtcatttttt tattggtttt 1080 

acatgagaga ccactttttg gaacttcaat ctttgtgcgc ttgaacttca tgcctaagtt 1140 

attaagttca acttcaatcc gtaagggctg aatttttagg catagatgcg taaacttcaa 1200 

ccttgtggac tgaagttgaa cttcgcccct tatggtggcc tgaagttgaa cttcaatcct 1260 

tgtgggctga acttgtgtga agttcaaccc acaaggatta aagtttcaaa aaatgacctc 1320 

tcaagcaaaa tctgcaaaaa aaagtggtct ctcatgcact tttacccatt cgcaaagtag 1380 



1 
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gctgaagttc agcccacaat tattcaagtt ccaaaaaatt tcacaatata tacctcctta 1440 

tctcggttat gatcttttgt atgatttagc aaaatggacg gggaaagtgc acgaaagacc 1500 

acttttgcca ttggtctttg ggtacaggcc actaatacca aaatatttag tttgtggcta 1560 

cttttgctta aagagataga acttcagtcc agaggccgga ttgaagttca gtccttaaag 1620 

attgaacttc gatccagtgc catatggact gaagttcagt caagtcctta agatggaact 1680 

tcagtccaga gccatatgga ctgaagttca atccttaaag atagaacttc agtccagggg 1740 

ccgtatggac tgaagttcag tcaattatca gaacttaagt cagtatttat ttagtaaagg 1800 

cccaaaagtg gttagtataa gaccaataaa aatagaggcc taaaactaaa taacagtgtt 1860 

aaaagtggct gatggacgaa atttctacaa aatggactcg aggtagcaat tcaacttcaa 1920 

cctatggtgt catagtcgta caattcttcc aatcacccct actaagtgaa gtgaagcgaa 1980 

gatgatgaga attgcagtgc gttactgtct caatggtaat ccctttttct cattctttgc 2040 

ttattcaatt gcaccccgac attattctac caatacatgt tccatttcag ttaaagggaa 2100 

ttttggggtt tctaatgaat ttgagaatgt taagtgttta gatgatgctt tcagtttgtt 2160 

ccgtcaaatg gttacaacta agcctcttcc ttctgctgtc tctttctcta aattgttgaa 2220 

agctttggta catatgaagc attactcttc tgttgtttct atttttcgag aaatccacaa 2280 

attacgtatt cctgttgatg ctttcgcctt gagcactgtg gttaacagtt gttgccttat 2340 

gcatcgtacc gatctcggat tttctgtatt agccattcac ttcaagaaag gtattccata 2400 

taatgaagtc acctttacta ccttaataag gggacttttt gctgaaaata aggtcaaaga 24 60 

tgctgttcat ttgttcaaaa agttggtgag ggagaatata tgtgagcctg atgaagtcat 2520 

gtatgggacg gtcatggatg ggctttgcaa gaagggccat actcaaaaag cttttgattt 2580 

gctccggtta atggaacaag gaattactaa gcccgataca tgcatctaca acattgttat 2640 

cgatgccttt tgcaaagatg ggatgctaga tggtgctacc agccttttga acgagatgaa 2700 

acaaaaaaac attcctccag acattattac atatacctca ttgatcgatg gtttgggtaa 2760 

gttaagtcag tgggaaaagg ttaggacttt gttccttgag atgatacatc ttaatattta 2820 

tccagatgtg tgcaccttca actccgtcat tgatggacta tgcaaagagg ggaaagttga 2880 

agatgccgag gaaataatga catacatgat cgaaaaaggt gtagaaccta atgagataac 2940 

ctacaatgtg gtaatggatg gatattgctt gcgtggtcaa atgggtagag cgaggagaat 3000 

ttttgattcc atgatagata agggcattga gcctgatatc attagctata ccgcactaat 3060 

aaatggatac gtcgagaaaa agaaaatgga taaggccatg caattgtttc gtgaaatttc 3120 

tcaaaatgga ttgaaaccta gtattgttac ctgcagtgtt ctcttgcgtg gtctttttga 3180 

agttggaaga actgaatgtg caaaaatatt ctttgatgag atgcaagctg cggggcacat 3240 

acctaattta tacactcatt gcactttgct tggtggttat tttaagaatg gacttgttga 3300 

agaggctatg tcacacttcc ataagttgga aaggaggaga gaagatacaa atattcaaat 3360 

ttacacggct gtcattaatg gattgtgcaa aaatggtaag ctcgacaaag ctcatgctac 3420 

gtttgagaag cttcccttga taggcttaca tcctgatgtg ataacataca ctgcaatgat 3480 

tagtggatat tgtcaagaag ggttgttaga tgaagctaaa gatatgctaa ggaaaatgga 3540 
ggacaatggt tgtttgccag acaaccgaac atacaatgtt attgtgcggg gatttttcag 3600 

aagcagtaaa gttagtgaaa tgaaggcttt tctgaaggaa atagctggga agagcttctc 3660 

atttgaggca gctactgtag agttattgat ggatattata gcagaggatc cttctttgct 3720 
taacatgatt ccagaatttc accgggataa taagaagtga ataacttttg cacctgtttt 3780 
ttttgacgat atcaccatta ttctgctatt tcctttcatc ttagcaaaag aaattgcatc 3840 

cagtggaatt gcggaagctg aaaaaatggc aagaagaaca ttgcttaagc tttcctggca 3900 

agcttatatc ggagggacat cattttggtt gttttggctc tcttctttat cttggaaatc 3960 

aaatgttctg cgctcttaat atcagaaaca atgtgaactc ccatatatgt acgagttata 4020 

agtttcggaa tatgatttca atggtttcag tattctattt ttgatatgga attaattttt 4080 

gagcgaccca gtgttgacca ttgcctacct tcggttatta tatgattgaa attccctcca 4140 

atctccaata ctcacttcat tttgtcttgt tgaatttttc aatttttctt tttctgttac 4200 

gattgtcatt ttcaccgcct tgagtatcca tcaggttcca gttgaaaaag aatcattttt 4260 
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tgccatgacc atcatgcttt ctgagtgcaa gatcaagaga ggtacttttc tctctaagaa 4320 
cctcttggtt ttttaagtgt tctgggttct ttcagtactt ttaagctatt ttctaatcct 4380 
ttgaagagat tcatacatat ctgtgcatgt gtttgtttct ttttttcggg tgatactttg 4440 
ttttatagct aaggattgaa aaggtaattt tcattttcat tagcaataga tatgaaacag 4500 
ctttgtaagg actctggagt ctcctaaaaa ttttggctat gcaaatagcc tattgcatca 4560 
atttgtcgtt gaaatccatg tatcataaaa aaa 4593 



<210> 2 

<211> 592 

<212> PRT 

<213> Petunia sp. 

<400> 2 

Met Met Arg lie Ala Val Arg Tyr Cys Leu Asn Gly Asn Pro Phe Phe 
15 10 15 

Ser Phe Phe Ala Tyr Ser lie Ala Pro Arg His Tyr Ser Thr Asn Thr 
20 25 30 

Cys Ser lie Ser Val Lys Gly Asn Phe Gly Val Ser Asn Glu Phe Glu 
35 40 45 

Asn Val Lys Cys Leu Asp Asp Ala Phe Ser Leu Phe Arg Gin Met Val 
50 55 60 

Thr Thr Lys Pro Leu Pro Ser Ala Val Ser Phe Ser Lys Leu Leu Lys 
65 70 75 80 

Ala Leu Val His Met Lys His Tyr Ser Ser Val Val Ser lie Phe Arg 
85 90 95 

Glu lie His Lys Leu Arg lie Pro Val Asp Ala Phe Ala Leu Ser Thr 
100 105 110 

Val Val Asn Ser Cys Cys Leu Met His Arg Thr Asp Leu Gly Phe Ser 
115 120 125 

Val Leu Ala lie His Phe Lys Lys Gly lie Pro Tyr Asn Glu Val Thr 
130 135 140 

Phe Thr Thr Leu lie Arg Gly Leu Phe Ala Glu Asn Lys Val Lys Asp 
145 150 155 160 

Ala Val His Leu Phe Lys Lys Leu Val Arg Glu Asn lie Cys Glu Pro 
165 170 175 

Asp Glu Val Met Tyr Gly Thr Val Met Asp Gly Leu Cys Lys Lys Gly 
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180 185 190 

His Thr Gin Lys Ala Phe Asp Leu Leu Arg Leu Met Glu Gin Gly lie 
195 200 205 

Thr Lys Pro Asp Thr Cys lie Tyr Asn lie Val lie Asp Ala Phe Cys 
210 215 220 

Lys Asp Gly Met Leu Asp Gly Ala Thr Ser Leu Leu Asn Glu Met Lys 
225 230 235 240 

Gin Lys Asn lie Pro Pro Asp lie lie Thr Tyr Thr Ser Leu lie Asp 
245 250 255 

Gly Leu Gly Lys Leu Ser Gin Trp Glu Lys Val Arg Thr Leu Phe Leu 
260 265 270 

Glu Met lie His Leu Asn lie Tyr Pro Asp Val Cys Thr Phe Asn Ser 
275 280 285 

Val lie Asp Gly Leu Cys Lys Glu Gly Lys Val Glu Asp Ala Glu Glu 
290 295 300 

lie Met Thr Tyr Met He Glu Lys Gly Val Glu Pro Asn Glu He Thr 
305 310 315 320 

Tyr Asn Val Val Met Asp Gly Tyr Cys Leu Arg Gly Gin Met Gly Arg 
325 330 335 

Ala Arg Arg He Phe Asp Ser Met He Asp Lys Gly He Glu Pro Asp 
340 345 350 

He He Ser Tyr Thr Ala Leu He Asn Gly Tyr Val Glu Lys Lys Lys 
355 360 365 

Met Asp Lys Ala Met Gin Leu Phe Arg Glu He Ser Gin Asn Gly Leu 
370 375 380 

Lys Pro Ser He Val Thr Cys Ser Val Leu Leu Arg Gly Leu Phe Glu 
385 390 395 400 

Val Gly Arg Thr Glu Cys Ala Lys He Phe Phe Asp Glu Met Gin Ala 
405 410 415 

Ala Gly His He Pro Asn Leu Tyr Thr His Cys Thr Leu Leu Gly Gly 
420 425 430 

Tyr Phe Lys Asn Gly Leu Val Glu Glu Ala Met Ser His Phe His Lys 
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435 

Leu Glu Arg Arg 
450 

lie Asn Gly Leu 
465 

Phe Glu Lys Leu 



Thr Ala Met lie 
500 

Lys Asp Met Leu 
515 

Arg Thr Tyr Asn 
530 

Ser Glu Met Lys 
545 

Phe Glu Ala Ala 



Pro Ser Leu Leu 
580 



440 

Arg Glu Asp Thr 
455 

Cys Lys Asn Gly 
470 

Pro Leu lie Gly 
485 

Ser Gly Tyr Cys 



Arg Lys Met Glu 
520 

Val lie Val Arg 

535 

Ala Phe Leu Lys 
550 

Thr Val Glu Leu 
565 

Asn Met lie Pro 



Asn lie Gin lie 
460 

Lys Leu Asp Lys 
475 

Leu His Pro Asp 
490 

Gin Glu Gly Leu 
505 

Asp Asn Gly Cys 



Gly Phe Phe Arg 
540 

Glu lie Ala Gly 
555 

Leu Met Asp lie 
570 

Glu Phe His Arg 
585 



445 

Tyr Thr Ala Val 



Ala His Ala Thr 
480 

Val lie Thr Tyr 
495 

Leu Asp Glu Ala 
510 

Leu Pro Asp Asn 
525 

Ser Ser Lys Val 



Lys Ser Phe Ser 
560 

lie Ala Glu Asp 
575 

Asp Asn Lys Lys 
590 



<210> 3 
<211> 21 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 

motif derived from 1,303 PPRs reported by Small & 
Peeters 

<400> 3 

Glu Glu Ala Leu Tyr Met Gly Pro Asn Thr Tyr Asn Ala Leu lie Asn 
15 10 15 

Ala Tyr Ala Lys Gly 
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20 



<210> 4 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 

motif derived from the 14 PPRs found in Rf-PPR592 

<400> 4 

Asp Ala Phe Met Gly Pro Asp Thr Tyr Leu lie Gly Leu Cys Lys Gly 
15 10 15 



<210> 5 

<211> 35 

<212> PRT 

<213> Petunia sp. 

<400> 5 

Asp Asp Ala Phe Ser Leu Phe Arg Gin Met Val Thr Thr Lys Pro Leu 
15 10 15 

Pro Ser Ala Val Ser Phe Ser Lys Leu Leu Lys Ala Leu Val His Met 
20 25 30 

Lys His Tyr 
35 



<210> 6 

<211> 35 

<212> PRT 

<213> Petunia sp . 

<400> 6 

Ser Ser Val Val Ser lie Phe Arg Glu lie His Lys Leu Arg lie Pro 
15 10 15 

Val Asp Ala Phe Ala Leu Ser Thr Val Val Asn Ser Cys Cys Leu Met 
20 25 30 

His Arg Thr 
35 



6 



WO 03/057859 



PCT/US03/00887 



<210> 7 

<211> 35 

<212> PRT 

<213> Petunia sp. 

<400> 7 

Asp Leu Gly Phe Ser Val Leu Ala lie His Phe Lys Lys Gly lie Pro 
15 10 15 

Tyr Asn Glu Val Thr Phe Thr Thr Leu He Arg Gly Leu Phe Ala Glu 
20 25 30 

Asn Lys Val 
35 



<210> 8 

<211> 35 

<212> PRT 

<213> Petunia sp. 

<400> 8 

Asp Ala Val His Leu Phe Lys Lys Leu Val Arg Glu Asn He Cys Glu 
15 10 15 

Pro Asp Glu Val Met Tyr Gly Thr Val Met Asp Gly Leu Cys Lys Lys 
20 25 30 

Gly His Thr 
35 



<210> 9 

<211> 35 

<212> PRT 

<213> Petunia sp. 

<400> 9 

Gin Lys Ala Phe Asp Leu Leu Arg Leu Met Glu Gin Gly He Thr Lys 
15 10 15 

Pro Asp Thr Cys He Tyr Asn He Val He Asp Ala Phe Cys Lys Asp 
20 25 30 

Gly Met Leu 
35 



7 



WO 03/057859 



PCT/US03/00887 



<210> 10 

<211> 35 

<212> PRT 

<213> Petunia sp. 

<400> 10 

Asp Gly Ala Thr Ser Leu Leu Asn Glu Met Lys Gin Lys Asn lie Pro 
15 10 15 

Pro Asp lie lie Thr Tyr Thr Ser Leu lie Asp Gly Leu Gly Lys Leu 
20 25 30 

Ser Gin Trp 

35 



<210> 11 

<211> 35 

<212> PRT 

<213> Petunia sp. 

<400> 11 

Glu Lys Val Arg Thr Leu Phe Leu Glu Met lie His Leu Asn lie Tyr 
15 10 15 

Pro Asp Val Cys Thr Phe Asn Ser Val lie Asp Gly Leu Cys Lys Glu 
20 25 30 

Gly Lys Val 

35 



<210> 12 

<211> 35 

<212> PRT 

<213> Petunia sp. 

<400> 12 

Glu Asp Ala Glu Glu lie Met Thr Tyr Met lie Glu Lys Gly Val Glu 
15 10 15 

Pro Asn Glu lie Thr Tyr Asn Val Val Met Asp Gly Tyr Cys Leu Arg 
20 25 30 

Gly Gin Met 
35 
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<210> 13 

<211> 35 

<212> PRT 

<213> Petunia sp. 

<400> 13 

Gly Arg Ala Arg Arg lie Phe Asp Ser Met lie Asp Lys Gly lie Glu 
15 10 15 

Pro Asp lie lie Ser Tyr Thr Ala Leu lie Asn Gly Tyr Val Glu Lys 
20 25 30 

Lys Lys Met 

35 



<210> 14 
■<211> 35 
<212> PRT 
<213> Petunia sp. 

<400> 14 

Asp Lys Ala Met Gin Leu Phe Arg Glu lie Ser Gin Asn Gly Leu Lys 
15 10 15 

Pro Ser lie Val Thr Cys Ser Val Leu Leu Arg Gly Leu Phe Glu Val 
20 25 30 

Gly Arg Thr 
35 



<210> 15 

<211> 35 

<212> PRT 

<213> Petunia sp. 

<400> 15 

Glu Cys Ala Lys lie Phe Phe Asp Glu Met Gin Ala Ala Gly His lie 
15 10 15 

Pro Asn Leu Tyr Thr His Cys Thr Leu Leu Gly Gly Tyr Phe Lys Asn 
20 25 30 

Gly Leu Val 
35 
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<210> 16 

<211> 35 

<212> PRT 

<213> Petunia sp. 

<400> 16 

Glu Glu Ala Met Ser His Phe His Lys Leu Glu Arg Arg Arg Glu Asp 
15 10 15 

Thr Asn lie Gin lie Tyr Thr Ala Val lie Asn Gly Leu Cys Lys Asn 
20 25 30 

Gly Lys Leu 
35 



<210> 17 

<211> 35 

<212> PRT 

<213> Petunia sp. 

<400> 17 

Asp Lys Ala His Ala Thr Phe Glu Lys Leu Pro Leu lie Gly Leu His 
15 10 15 

Pro Asp Val lie Thr Tyr Thr Ala Met lie Ser Gly Tyr Cys Gin Glu 
20 25 30 

Gly Leu Leu 
35 



<210> 18 

<211> 35 

<212> PRT 

<213> Petunia sp. 

<400> 18 

Asp Glu Ala Lys Asp Met Leu Arg Lys Met Glu Asp Asn Gly Cys Leu 
15 10 15 

Pro Asp Asn Arg Thr Tyr Asn Val lie Val Arg Gly Phe Phe Arg Ser 
20 25 30 

Ser Lys Val 
35 
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<210> 19 
<211> 8440 
<212> DNA 
<213> Petunia sp. 



<400> 19 

ggatccaaaa tttcactaaa ggttaaacgc gaggatactg aagttggaga gcaatgtggt 60 

atcttggtgc atggacggag tcatgggggt atagttgctg gctatttagc taactggaga 120 

actgttgtga ggaattattt aaagaatgct actttctcgt ccacataaac atgtccaaat 180 

attttctact tgatagagag ttcaaggaaa tagtgtggat ttcttcccaa acacaaacga 240 

tttgagaaaa ctgaagtgaa ggctgaagag aaaactaaag agaactggaa gctaagaaac 300 

acagaagcac aaacctataa acatagacac tggcatgttg cagaaaattt taactttcga 360 

ttctccagta gaaagacaca aatacatcag taaattttct ttaggctcaa gcaaggatac 420 

atcttggtag aatttgcatt ataccaacat aataagctca aaaaaataac taagctgcaa 480 

ctagctactt ggtccgagaa gcttttgcta tcaggaagtt ccacagttcc aaaaccaagg 540 

tagactatca agctagttcc caccagtcat tttcttagac tttgctctca cgataaacta 600 

agatcatttt tttatgatac atggatttca acgacaaatt gatgcaatag gctatttgca 660 

tagccaaaat ttttaggaga ctccagagtc cttacaaagc tgtttcatat ctattgctaa 720 

tgaaaatgaa aattaccttt tcaatcctta gctataaaac aaagtatcac ccgaaaaaaa 780 

gaaacaaaca catgcacaga tatgtatgaa tctcttcaaa ggattagaaa atagcttaaa 840 

agtactgaaa gaacccagaa cacttaaaaa accaagaggt tcttagagag aaaagtacct 900 

ctcttgatct tgcactcaga aagcatgatg gtcatggcaa aaaatgattc tttttcaact 960 

ggaacctgat ggatactcaa ggcggtgaaa atgacaatcg taacagaaaa agaaaaattg 1020 

aaaaattcaa caagacaaaa tgaagtgagt attggagatt ggagggaatt tcaatcatat 1080 

aataaccgaa ggtaggcaat ggtcaacact gggtcgctca aaaattaatt ccatatcaaa 1140 

aatagaatac tgaaaccatt gaaatcatat tccgaaactt ataactcgta catatatggg 1200 

agttcacatt gtttctgata ttaagagcgc agaacatttg atttccaaga taaagaagag 1260 

agccaaaaca accaaaatga tgtccctccg atataagctt gccaggaaag cttaagcaat 1320 

gttcttcttg ccattttttc agcttccgca attccactgg atgcaatttc ttttgctaag 1380 

atgaaaggaa atagcagaat aatggtgata tcgtcaaaaa aaacaggtgc aaaagttatt 1440 

cacttcttat tatcccggtg aaattctgga atcatgttaa gcaaagaagg atcctctgct 1500 

ataatatcca tcaataactc tacagtagct gcctcaaatg agaagctctt cccagctatt 1560 

tccttcagaa aagccttcat ttcactaact ttactgcttc tgaaaaatcc ccgcacaata 1620 

acattgtatg ttcggttgtc tggcaaacaa ccattgtcct ccattttcct tagcatatct 1680 

ttagcttcat ctaacaaccc ttcttgacaa tatccactaa tcattgcagt gtatgttatc 1740 

acatcaggat gtaagcctat caagggaagc ttctcaaacg tagcatgagc tttgtcgagc 1800 

ttaccatttt tgcacaatcc attaatgaca gccgtgtaaa tttgaatatt tgtatcttct 1860 

ctcctccttt ccaacttatg gaagtgtgac atagcctctt caacaagtcc attcttaaaa 1920 

taaccaccaa gcaaagtgca atgagtgtat aaattaggta tgtgccccgc agcttgcatc 1980 

tcatcaaaga atatttttgc acattcagtt cttccaactt caaaaagacc acgcaagaga 2040 

acactgcagg taacaatact aggtttcaat ccattttgag aaatttcacg aaacaattgc 2100 

atggccttat ccattttctt tttctcgacg tatccattta ttagtgcggt atagctaatg 2160 

atatcaggct caatgccctt atctatcatg gaatcaaaaa ttctcctcgc tctacccatt 2220 

tgaccacgca agcaatatcc atccattacc acattgtagg ttatctcatt aggttctaca 2280 

cctttttcga tcatgtatgt cattatttcc tcggcatctt caactttccc ctctttgcat 2340 

agtccatcaa tgacggagtt gaaggtgcac acatctggat aaatattaag atgtatcatc 2400 

tcaaggaaca aagtcctaac cttttcccac tgacttaact tacccaaacc atcgatcaat 2460 
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gaggtatatg taataatgtc tggaggaatg tttttttgtt tcatctcgtt caaaaggctg 2520 
gtagcaccat ctagcatccc atctttgcaa aaggcatcga taacaatgtt gtagatgcat 2580 
gtatcgggct tagtaattcc ttgttccatt aaccggagca aatcaaaagc tttttgagta 2640 
tggcccttct tgcaaagccc atccatgacc gtcccataca tgacttcatc aggctcacat 2700 
atattctccc tcaccaactt tttgaacaaa tgaacagcat ctttgacctt attttcagca 2760 
aaaagtcccc ttattaaggt agtaaaggtg acttcattat atggaatacc tttcttgaag 2820 
tgaatggcta atacagaaaa tccgagatcg gtacgatgca taaggcaaca actgttaacc 2880 
acagtgctca aggcgaaagc atcaacagga atacgtaatt tgtggatttc tcgaaaaata 2940 
gaaacaacag aagagtaatg cttcatatgt accaaagctt tcaacaattt agagaaagag 3000 
acagcagaag gaagaggctt agttgtaacc atttgacgga acaaactgaa agcatcatct 3060 
aaacacttaa cattctcaaa ttcattagaa accccaaaat tccctttaac tgaaatggaa 3120 
catgtattgg tagaataatg tcggggtgca attgaataag caaagaatga gaaaaaggga 3180 
ttaccattga gacagtaacg cactgcaatt ctcatcatct tcgcttcact tcacttagta 3240 
ggggtgattg gaagaattgt acgactatga caccataggt tgaagttgaa ttgctacctc 3300 
gagtccattt tgtagaaatt tcgtccatca gccactttta acactgttat ttagttttag 3360 
gcctctattt ttattggtct tatactaacc acttttgggc ctttactaaa taaatactga 3420 
cttaagttct gataattgac tgaacttcag tccatacggc ccctggactg aagttctatc 3480 
tttaaggatt gaacttcagt ccatatggct ctggactgaa gttccatctt aaggacttga 3540 
ctgaacttca gtccatatgg cactggatcg aagttcaatc tttaaggact gaacttcaat 3600 
ccggcctctg gactgaagtt ctatctcttt aagcaaaagt agccacaaac taaatatttt 3660 
ggtattagtg gcctgtaccc aaagaccaat ggcaaaagtg gtctttcgtg cactttcccc 3720 
gtccattttg ctaaatcata caaaagatca taaccgagat aaggaggtat atattgtgaa 3780 
attttttgga acttgaataa ttgtgggctg aacttcagcc tactttgcga atgggtaaaa 3840 
gtgcatgaga gaccactttt ttttgcagat tttgcttgag aggtcatttt ttgaaacttt 3900 
aatccttgtg ggttgaactt cacacaagtt cagcccacaa ggattgaagt tcaacttcag 3960 
gccaccataa ggggcgaagt tcaacttcag tccacaaggt tgaagtttac gcatctatgc 4020 
ctaaaaattc agcccttacg gattgaagtt gaacttaata acttaggcat gaagttcaag 4080 
cgcacaaaga ttgaagttcc aaaaagtggt ctctcatgta aaaccaataa aaaaatgact 4140 
tcaaaccaaa tactagtggt aaaagtggcc tgtgagctaa ttttttactt gtgaatgtat 4200 
gaagttcggc ccacaaggtc tgaagttcgg cccacaatga ttaaagttcc aaaaaagcta 4260 
aaaactagct ctttttcttt gaaagtagct tcaacggtaa aggcaagcca aaaagtggct 4320 
tttccttgca cttttcctta agtttataat gaataaatat cgaatggaaa tacaatctta 4380 
tagtgatgcc catttctcac tccctaaggc agcattgtct ttgactgcct tttgcttaag 44 40 
caatatttgc ttttgaattt cttcctacat ttttctacat aatttgacgt gttgtctagg 4500 
caacacttgt ccgtacaatc ttttcgaata ctttcccaga caacacttct tttaacgtaa 4560 
aaatttggcc atgtagtcta tttgcccctc gctaaatgaa aactagccag catttcaaaa 4620 
tttttagaag accagccaaa atcaagaaac atataaatat ttcagttttt tgtactatga 4680 
cttgaactac aattttcaaa aagaatgttt ttaaaactac ccagcttaca ctaaaaaatt 4740 
acactaccat cactcaaatt ttgtttaaac attttatagc cacacaacca caacattggg 4800 
gtacactatt gcggtgtcaa acttgtatac tattgggggt atacgaatac taaaaaggta 4860 
aataatagaa gatatcgtat aggtatactg tcgctatggg atatacaagc ttggagtata 4920 
tgcgactata aaataaaata tttttagcca aatggccgac tcaggtaaat atttttagga 4980 
aatattacac cacatgacta cctaatagct atattacaaa aaataacaat atttttatca 5040 
tatttacaat acataacaat gagttttgta tatattatgt aaaaaaatta ttgttatttt 5100 
ttgtaaagat aacattatta gcggagctac cctttttcaa gggtatgtaa atataacact 5160 
gtgcaaatag acagaaaaaa tcagtttgta tatatatatc agatattgat tcccccttca 5220 
ttttttcgta tgtttacttt tttatattta tatatccctt agtaaaaata ctggctccgc 5280 
cactgccagt aaggtagtat tagtttgcgt cgctcaataa agtaacatct atcgtttatt 5340 
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tttcatcaac attaaaaagg aagattcact atccacatag gcatcatcat tatcaaagaa 5400 
tatcagttca tacattgtat atatataact ttctcaaata aactaacttt aaaatgaagt 54 60 
acattaaaaa ggaagattca ctatcctttt aatatttcgt atatttactt atttatattt 5520 
tgatactcct tagtaaaaat actggctccg ccactaccag tgatgtaata ttaattcgcg 5580 
tccctcacta aagtaacacc tataatttaa ttttcatgaa gtcagagtta gcattggaaa 5640 
gggatataag cacatgcatt gtgtatatat atataacttg ctcaaataaa ctaacttaaa 5700 
aatgaaattt tacttttcct agtacaatga actatgcatc aatgcgtaat tagttgaggt 5760 
cggctatatg aatatgttat taatttgaaa gcaaaacata ataactgata gaagaatttt 5820 
gcacctaaaa attgaacttg agctgcttca gttactatct catttttcac tatatatgtg 5880 
tgtatcagct aattctatga tttaattaaa caaattgtaa gtattaacaa aataacgaat 5940 
aaatatggaa aataagtact tgatgaacgt agggccggag ttggccgagg tgaccggaga 6000 
caatggaaag cagagttact atttttgact aaatagccac aaaagaatca ttgtttttac 6060 
aatgtagcaa gttggcacga ttatgattct tgacacaata gccacattat agaaagataa 6120 
tgtggcacta atgaggtaat tttcattatg gaatgataac aaacaaataa gtacacgatt 6180 
aaaacaaact gaaagggttt tgcgcgataa tttaatcatt gttttacgaa aatactcatc 6240 
aaaatcaaaa tatttatctg cctctgcatg taagtttcat atttactcgt cttgccataa 6300 
tttatatgaa aaaatttact cacaccggat atatatacct aaccatagca attaactatg 6360 
gtaatgtgat gtaatgaaga gagaatgtct attaattaat tatggctaag tgatagtagt 6420 
gtattgtaaa caaatgacgt gcatttgttg attagacact tacaaaaata cccacgaaat 6480 
ctaaaataat tacagccact atccactact ttcaaatatt atctggccta cccattaaaa 6540 
tatttactca ctctacccct ccagacttat atattataag gtataaaaag gtaaacaata 6600 
ataaatggtc ctccagactt ttataccata atttatgcag ccttaaaggt atacacctat 6660 
aaacaaaggt atacaataaa aaatgggtat gttgggtaaa tacttttagt tttatgggta 6720 
gagtaatttt taatgggtat gacttgtaaa tactttaaat ttcatgggta tagagtgtaa 6780 
aaattccttt gttgattggg tatatacacc cgatgtgggt aagtacttgc ctaatttttg 6840 
ccctaaggta aatatagact tatagtataa aaaaaaatac gcagtgtgat agatactttg 6900 
aatttcatga gtaatgtaat ttttaatagg tatagtaagg taaatacttt atattccaag 6960 
ggtacacatt gtaaaaagct caatatttta ttccaaagaa taagagacca aacaatgtgt 7020 
ttgagattta ttactttgtt gtccaccaaa ctaaaaagaa aactttagaa gtctaaatta 7080 
caataatctt aacatgcatt ttacgaataa atatcacaaa atctcaaact attagagata 7140 
atgtcgtgga tgatgttaac atattggact acacaaccca ttgtacaata attttgaagc 7200 
atgtatatgc acgaccaaga ctccatcatc atagatcaaa tgaatgttca ttttaatgca 7260 
tgaaacctaa gtagaacatt tatgccttaa tgaactaaaa ccaagcaaaa agatacatct 7320 
acttgtgcaa ttgaatgaat tctaccgtat atactaatat acaccagagg ttagtttaac 7380 
acttggaact tcaaaaggtg tacaaccata gagtttcctt tacattgatg gtttctttca 7440 
tttcactaac tgataaaatg aaggctggta tagtctacca aatccctagt tccctgtgaa 7500 
cttgcatccc ttctagctac atgcagaaca tgtcctttag atcccatagg tgtattgcca 7560 
tttgccactg aacaatggag gacaatgtat aattgtcctc ctcacccatt gcacatactc 7620 
tgtcatttgc tgcacatcta catgcctttt ctgaatattc ttctgagtca aataagcatc 7680 
atgagacatc tgtcaagtat ctttgaatgg gataatcaca tttccaaatc gaaaggttct 7740 
tgtcttaaca agtcaagctg catctcgaca aagagacttc gttgatgaaa tgcgaccata 7800 
aagagcacat gcaaaccagt tgttaaaagc attgtacaca tatacctact tctgatgtga 7860 
ataaaagaaa gtcgatcaat gacagaggaa aacagtcaat ctataaccac aaaaatactt 7920 
ttctttaaaa gtacgaccac atagataact ataatttccc gtagatgtca aactcttatt 7980 
gaataaaaaa taaacaaaca tgtactattt gctcatttat ccgactgtca caaggatttt 8040 
cttaatgatg gtataatagg agcaatccct tttatgacag atgcactaat ttgtttgggt 8100 
gcatatttca atgcagaact gtggggtata taatctaaaa tatcattcaa atcaaacctg 8160 
ggaacgattg agagaagatt agcatggcct ctgcacaagg atgacacgca taaatcgaga 8220 
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aatgttccaa ataaaggaaa tatatatatt acctgtttca attggcatag ttcttaaaga 8280 

agttttggca gttaaagtat taatagttta ccttgtttcg attgtgggat ttagccttgg 8340 

ggttgtctgg gacggacctg tgattattct gctaatctcc ttgtatattc atgcaatgtg 8400 

cagtttaatc cagtgcattt tgcgtgttat ggatggatcc 8440 



<210> 20 

<211> 2637 

<212> DNA 

<213> Oryza sativa 

<400> 20 

atggcgcgcc gcgtccctac ccgcccgcgc ggcggtggcg gcggcggcgt cccacgctcg 60 

gagggctcga tccaagggcg aggaggccgc gcggggggca gtggcgccga ggacgcacgc 120 

cacgtgttcg acgaattgct ccggcgtggc aggggcgcct cgatctacgg cttgaaccgc 180 

gccctcgccg acgtcgcgcg tcacagcccc gcggccgccg tgtcccgcta caaccgcatg 240 

gcccgagccg gcgccggcaa ggtaactccc accgtgcaca cctatgccat cctcatcggc 300 

tgctgctgcc gtgcgggccg cttggacctc ggtttcgcgg ccttgggcaa tgtcgtcaag 360 

aagggattta gagtggatgc catcaccttc actcctctgc tcaagggcct ctgtgccgac 420 

aagaggacga gcgacgcaat ggacatagtg ctccgcagaa tgaccgagct cggctgcata 480 

ccagatgtct tctcctacaa taatcttctc aagggtctgt gtgatgagaa cagaagccaa 540 

gaagctctcg agctgctgca catgatggct gatgatcgag gaggaggtag cccacctgat 600 

gtggtgtcgt ataacactgt cctcaatggc ttcttcaaag agggggattc agacaaagct 660- 

tacagtacat accatgaaat gctggaccgg gggattttac cagatgttgt gacctacagc 720 

tctattattg ctgcgttatg caaggctcaa gctatggaca aagccatgga ggtacttaac 780 

accatggtta agaatggtgt catgcctgat tgcatgacat ataatagtat tctgcatgga 840 

tattgctctt cagggcagcc aaaagaggct attggaacac tcaaaaagat gcgcagtgat 900 

ggcgtcgaac caaatgttgt tacttatagt tcactgatga attatctttg caagaatgga 960 

agatccaccg aagctagaaa gattttcgat tctatgacca agaggggcct agagcctgat 1020 

attgctacct atcgtaccct gcttcagggg tatgctacca aaggagccct tgttgagatg 1080 

catgctctct tggatttgat ggtacgaaat ggtatccaac cggatcatca tgtattcaac 1140 

attctaatat gtgcatacgc taaacaagag aaagtagatc aggcaatgct tgtattcagc 1200 

aaaatgaggc agcatggatt gaatccgaat gtagtgtgct atggaacagt tatagatgta 1260 

ctttgcaagt caggcagtgt agatgatgct atgctttatt ttgagcagat gatcgatgaa 1320 

ggactaaccc ctaacattat tgtgtatacc tccctaattc atggtctgtg cacctgtgac 1380 

aaatgggaca aggctgaaga gttaattctt gaaatgttgg atcgaggcat ctgtctgaac 1440 

actattttct ttaattcaat aattgacagt cattgcaaag aagggagggt tatagaatct 1500 

gaaaaactct ttgacttgat ggtacgaatt ggtgtgaagc ccgatatcat tacgtacaat 1560 

acactcatcg atggatgctg cttagctggt aagatggatg aagcaacgaa gttacttgcc 1620 

agcatggtct cagttggggt gaaacctgat attgttacct atggcacctt gattaatggc 1680 

tactgtagag ttagcaggat ggatgacgca ttagctcttt tcaaagagat ggtgagcagt 1740 

ggtgttagtc ctaatattat tacgtataac ataattctgc aaggtttatt tcataccaga 1800 

agaactgctg ctgcaaaaga actctatgtc agtattacca aaagtggaac acagcttgaa 1860 

cttagcacgt acaacataat ccttcatgga ctttgcaaaa acaatctcac tgacgaggca 1920 

cttcgaatgt ttcagaacct atgtttgacg gatttacagc tggagactag gacttttaac 1980 

attatgattg gtgccttact taaatgtgga agaatggatg aagctaagga tttgtttgct 2040 

gctcactcgg ctaacggttt agtgccagat gttaggacct acagtttaat ggcagaaaat 2100 

cttatagagc aggggtcgct agaagaattg gatgatctat ttctttcaat ggaggagaat 2160 
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ggctgttccg ccgactcccg catgctaaat tccattgtta ggaaactgtt acagaggggt 2220 
gatataacca gggctggcac ttacctgttc atgattgatg agaagcactt ctccctcgaa 2280 
gcatccactg cttccttctt gttagaatct tccccaatcg tctgggagca aatatcaaga 2340 
atatcacact tgtctgtaaa tttgaaatta attaagcagc ccaaatgcac ctgtgagtta 2400 
ggcccaaagt ggtcccaaaa tctgcctaaa cctggcacaa attcggtcgg tagtgtcgca 24 60 
cagtttcact tatcgcgcgg cggttatcgc gcttaccgcg ggggtacgac ggttaccgca 2520 
ctaccgcagg gtgacggtaa ccccggccca aacgataagg taaaccctgg tcgcacaaat 2580 
ttggcccaaa accgaccagt tatcgcgcta ccgcgggatg cctcagtagg accttag 2637 



<210> 21 
<211> 878 
<212> PRT 

<213> Oryza sativa 
<400> 21 

Met Ala Arg Arg Val Pro Thr Arg Pro Arg Gly Gly Gly Gly Gly Gly 
15 10 15 

Val Pro Arg Ser Glu Gly Ser lie Gin Gly Arg Gly Gly Arg Ala Gly 
20 25 30 

Gly Ser Gly Ala Glu Asp Ala Arg His Val Phe Asp Glu Leu Leu Arg 
35 40 45 

Arg Gly Arg Gly Ala Ser lie Tyr Gly Leu Asn Arg Ala Leu Ala Asp 
50 55 60 

Val Ala Arg His Ser Pro Ala Ala Ala Val Ser Arg Tyr Asn Arg Met 
65 70 75 80 

Ala Arg Ala Gly Ala Gly Lys Val Thr Pro Thr Val His Thr Tyr Ala 
85 90 95 

lie Leu lie Gly Cys Cys Cys Arg Ala Gly Arg Leu Asp Leu Gly Phe 
100 105 110 

Ala Ala Leu Gly Asn Val Val Lys Lys Gly Phe Arg Val Asp Ala lie 
115 120 125 

Thr Phe Thr Pro Leu Leu Lys Gly Leu Cys Ala Asp Lys Arg Thr Ser 
130 135 140 

Asp Ala Met Asp lie Val Leu Arg Arg Met Thr Glu Leu Gly Cys lie 
145 150 155 160 

Pro Asp Val Phe Ser Tyr Asn Asn Leu Leu Lys Gly Leu Cys Asp Glu 
165 170 175 
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Asn Arg Ser Gin Glu Ala Leu Glu Leu Leu His Met Met Ala Asp Asp 
180 185 190 

Arg Gly Gly Gly Ser Pro Pro Asp Val Val Ser Tyr Asn Thr Val Leu 
195 200 205 

Asn Gly Phe Phe Lys Glu Gly Asp Ser Asp Lys Ala Tyr Ser Thr Tyr 
210 215 220 

His Glu Met Leu Asp Arg Gly He Leu Pro Asp Val Val Thr Tyr Ser 
225 230 235 240 

Ser He He Ala Ala Leu Cys Lys Ala Gin Ala Met Asp Lys Ala Met 
245 250 255 

Glu Val Leu Asn Thr Met Val Lys Asn Gly Val Met Pro Asp Cys Met 
260 265 270 

Thr Tyr Asn Ser He Leu His Gly Tyr Cys Ser Ser Gly Gin Pro Lys 
275 280 285 

Glu Ala He Gly Thr Leu Lys Lys Met Arg Ser Asp Gly Val Glu Pro 
290 295 300 

Asn Val Val Thr Tyr Ser Ser Leu Met Asn Tyr Leu Cys Lys Asn Gly 
305 310 315 320 

Arg Ser Thr Glu Ala Arg Lys He Phe Asp Ser Met Thr Lys Arg Gly 
325 330 335 

Leu Glu Pro Asp He Ala Thr Tyr Arg Thr Leu Leu Gin Gly Tyr Ala 
340 345 350 

Thr Lys Gly Ala Leu Val Glu Met His Ala Leu Leu Asp Leu Met Val 
355 360 365 

Arg Asn Gly He Gin Pro Asp His His Val Phe Asn He Leu He Cys 
370 375 380 

Ala Tyr Ala Lys Gin Glu Lys Val Asp Gin Ala Met Leu Val Phe Ser 
385 390 395 400 

Lys Met Arg Gin His Gly Leu Asn Pro Asn Val Val Cys Tyr Gly Thr 
405 410 415 

Val He Asp Val Leu Cys Lys Ser Gly Ser Val Asp Asp Ala Met Leu 
420 425 430 
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Tyr Phe Glu Gin Met lie Asp Glu Gly Leu Thr Pro Asn He He Val 
435 440 445 

Tyr Thr Ser Leu He His Gly Leu Cys Thr Cys Asp Lys Trp Asp Lys 
450 455 460 

Ala Glu Glu Leu He Leu Glu Met Leu Asp Arg Gly He Cys Leu Asn 
465 470 475 480 

Thr He Phe Phe Asn Ser He He Asp Ser His Cys Lys Glu Gly Arg 
485 490 495 

Val He Glu Ser Glu Lys Leu Phe Asp Leu Met Val Arg He Gly Val 
500 505 510 

Lys Pro Asp He He Thr Tyr Asn Thr Leu He Asp Gly Cys Cys Leu 
515 520 525 

Ala Gly Lys Met Asp Glu Ala Thr Lys Leu Leu Ala Ser Met Val Ser 
530 535 540 

Val Gly Val Lys Pro Asp He Val Thr Tyr Gly Thr Leu He Asn Gly 
545 550 555 560 

Tyr Cys Arg Val Ser Arg Met Asp Asp Ala Leu Ala Leu Phe Lys Glu 
565 570 575 

Met Val Ser Ser Gly Val Ser Pro Asn He He Thr Tyr Asn He He 
580 585 590 

Leu Gin Gly Leu Phe His Thr Arg Arg Thr Ala Ala Ala Lys Glu Leu 
595 600 605 

Tyr Val Ser He Thr Lys Ser Gly Thr Gin Leu Glu Leu Ser Thr Tyr 
610 615 620 

Asn He lie Leu His Gly Leu Cys Lys Asn Asn Leu Thr Asp Glu Ala 
625 630 635 640 

Leu Arg Met Phe Gin Asn Leu Cys Leu Thr Asp Leu Gin Leu Glu Thr 
645 650 655 

Arg Thr Phe Asn He Met He Gly Ala Leu Leu Lys Cys Gly Arg Met 
660 665 670 

Asp Glu Ala Lys Asp Leu Phe Ala Ala His Ser Ala Asn Gly Leu Val 
675 680 685* 
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Pro Asp Val Arg Thr Tyr Ser Leu Met Ala Glu Asn Leu lie Glu Gin 
690 695 700 

Gly Ser Leu Glu Glu Leu Asp Asp Leu Phe Leu Ser Met Glu Glu Asn 
705 710 715 720 

Gly Cys Ser Ala Asp Ser Arg Met Leu Asn Ser lie Val Arg Lys Leu 
725 730 735 

Leu Gin Arg Gly Asp lie Thr Arg Ala Gly Thr Tyr Leu Phe Met lie 
740 745 750 

Asp Glu Lys His Phe Ser Leu Glu Ala Ser Thr Ala Ser Phe Leu Leu 
755 760 765 

Glu Ser Ser Pro lie Val Trp Glu Gin lie Ser Arg lie Ser His Leu 
770 775 780 

Ser Val Asn Leu Lys Leu lie Lys Gin Pro Lys Cys Thr Cys Glu Leu 
785 790 795 800 

Gly Pro Lys Trp Ser Gin Asn Leu Pro Lys Pro Gly Thr Asn Ser Val 
805 810 815 

Gly Ser Val Ala Gin Phe His Leu Ser Arg Gly Gly Tyr Arg Ala Tyr 
820 825 830 

Arg Gly Gly Thr Thr Val Thr Ala Leu Pro Gin Gly Asp Gly Asn Pro 
835 840 845 

Gly Pro Asn Asp Lys Val Asn Pro Gly Arg Thr Asn Leu Ala Gin Asn 
850 855 860 

Arg Pro Val lie Ala Leu Pro Arg Asp Ala Ser Val Gly Pro 
865 870 875 



<210> 22 

<211> 3660 

<212> DNA 

<213> Oryza sativa 

<400> 22 

atggcgcgcc gcgccgcttc ccgcgtccgc 
ggctcgaccc aagggcgagg gggccgcacg 
gtgttcgacg aattgctccg gcgtggcagg 
ctcgccgacg tcgcgcgtca cagccccgcg 



gccggcgctg ttggcgccct tcgctcggag 60 
gggggcagtg gcgccgagga cgcacgccac 120 
ggcgcctcga tctacggctt gaactgcgcc 180 
gccgccgtgt cccgctacaa ccgcatggcc 240 
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cgagccggcg ccgacgaggt aactcccaac 
tgctgctgcg cgggccgctt ggacctcggt 
ggatttagag tggacgccat cgccttcact 
aggacgagcg acgcaatgga catagtgctc 
aatgtcttct cctacaatat tcttctcaag 
gctctcgagc tgctccaaat gatgcctgat 
tcgtatacca ctgtcatcaa tggcttcttc 
acataccatg aaatgctgga ccgggggatt 
attgctgcgt tatgcaaggc tcaagctatg 
gttaagaatg gtgtcatgcc taattgcagg 
tcttcagggc agccgaaaga ggctattgga 
gaaccagatg ttgttactta taactcgctc 
acggaagcta gaaagatgtt cgattctatg 
acctatggta ccctgcttca ggggtatgct 
ctcttggatt tgatggtacg aaacggtatc 
atatgtgcat acgctaaaca agggaaagta 
aggcagcaag gattgaatcc ggatacagtg 
aagtcaggca gagtagaaga tgctatgcgt 
agccctggca acattgttta taactcccta 
gacaaggcta aagagttaat tcttgaaatg 
ttctttaatt caataattga cagtcattgc 
ctctttgacc tgatggtacg tattggtgtg 
atcgatggat attgcttggc aggtaagatg 
gtctcagttg gaatgaaacc tgattgtgtt 
aaaattagca ggatggaaga tgcgttagtt 
agtcctgata ttattacgta taatataatt 
gctgctgcaa aagaactcta tgtcgggatt 
acatacaaca taatccttca tgggctttgc 
atgtttcaga acctatgttt gacggattta 
attggtgcat tgcttaaagt tggcagaaat 
tcggctaacg gtttagtgcc agatgttagg 
gagcaggggt tgctagaaga attggatgat 
actgccaact cccgcatgct aaattccatt 
accagggctg gcacttacct gttcatgatt 
actgcttcct tgtttttaga tcttttgtct 
attagaggag ggatcttctc tttatgtgta 
gattcagaat ctggtgtcca ttttcttctt 
gattccaaag caccatcgat aggttccaaa 
aatggatcaa ataaggattc tgactgcatt 
attctcttaa gctgtcaagt gatcaagtcg 
ccagaactgt gtccaaagtg gcttgctttg 
gtttcagtag aggagacttc tgacacagtg 
ttaagggaga tgggaggtct tgatagtatt 
ttggagaatc tcataaagga tacatccact 
ttgcaaagtg ctgctctcct cttgaaatgt 
agcgatgata acaagaccca tttgcttaat 
ttgctttctt ttgttggtgt cattatcaat 
cttcagaatt cttctgttgt ttccagctct 



ttgtgcacct acggcattct catcggttcc 300 
ttcgcggcct tgggcaatgt cattaagaag 360 
cctctgctca agggcctctg tgctgacaag 420 
cgcagaatga cccagcttgg ctgcatacca 480 
gggctgtgtg atgagaacag aagccaagaa 540 
gatggaggtg actgcccacc tgatgtggtg 600 
aaggaggggg atctggacaa agcttacggt 660 
ttaccaaatg ttgttaccta cagctctatt 720 
gacaaagcca tggaggtact taccagcatg 780 
acgtataata gtatcgtgca tgggtattgc 840 
tttctcaaaa agatgcacag tgatggtgtc 900 
atggattatc tttgcaagaa cggaagatgc 960 
accaagaggg gcctaaagcc tgaaattact 1020 
accaaaggag cccttgttga gatgcatggt 1080 
caccctaatc attatgtttt cagcattcta 1140 
gatcaggcaa tgcttgtgtt cagcaaaatg 1200 
acctatggaa cagttatagg catactttgc 1260 
tattttgagc agatgatcga tgaaagacta 1320 
attcatagtc tctgtatctt tgacaaatgg 1380 
ttggatcgag gcatctgtct ggacactatt 1440 
aaagaaggga gggttataga atctgaaaaa 1500 
aagcccgata tcattacgta cagtactctc 1560 
gatgaagcaa cgaagttact tgccagcatg 1620 
acatataata ctttgattaa tggctactgt 1680 
ctttttaggg agatggagag cagtggtgtt 1740 
ctgcaaggtt tatttcaaac cagaagaact 1800 
accgaaagtg gaacgcagct tgaacttagc 1860 
aaaaacaatc tcactgacga ggcacttcga 1920 
cagctggaga ctaggacttt taacattatg 1980 
gatgaagcca aggatttgtt tgcagctctc 2040 
acctacagtt taatggcaga aaatcttata 2100 
ctatttcttt caatggagga gaatggctgt 2160 
gttaggaaac tgttacagag gggtgatata 2220 
gatgagaagc acttctccct cgaagcatcc 2280 
gggggaaaat atcaagaata tcatagttgt 2340 
aatagcgagg ttcaagaaaa tcatttgttg 2400 
aaattattaa atcctccagt gaatcttgtt 24 60 
cttcttggaa tcagtaaagt tcaaatgctt 2520 
tcagaggaaa tcctttcaaa agttgaagag 2580 
ctcgacaaag atgacaagaa aacaacaagg 2640 
ttgacaatgg aaaatgcatg cttgtctgct 2700 
tccagagttg gaggaaattt taaagagaca 2760 
tttgacgtta tggtggattt tcattcaaca 2820 
tcagctttgg accgaaatga aggaacatct 2880 
ttgaaaatat tggaaaatgc catatttcta 2940 
atgagtagaa aattgaaccc gaaacgctcc 3000 
actattgagt tattatcagc tctttcaata 3060 
acatatccga aatcgtctaa agtctctcaa 3120 
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cagagttact ctgtggtgat ggcggggggc gaccgtggcc gaggcgtgga gtgccatccg 3180 
catcagggtg tatcggccgc gctgctccgc cctggtccgc aggctttggc ggcgagctgg 3240 
cggcggaggg agactgtggt gagatcggat ttcgccgctg gtggtgtcgc taccatgggg 3300 
gattcgccgc aggcgctctc agatcggtta tgcgggagcg caacaaaagt atggcgtggc 3360 
ggcgcggagt ggacggccga ggcgttcgcg cggaatgggg ctgcgggacc gagccagtct 3420 
cgcttgccgg taacgcggaa ccgagctcag cactacattg caaagatttg ggcaactctg 3480 
acaatttcca tgttctacaa gcttgacgtc gagggaatgg agaacctgcc accgaatagt 3540 
agccctgcta tctatgttgc gaaccatcag agttttttgg atatctatac ccttctaact 3600 
ctaggaaggt gtttcaagtt tataagcaag acaagtatat ttatgttccg aattatttga 3660 



<210> 23 
<211> 1219 
<212> PRT 

<213> Oryza sativa 

<400> 23 
Met Ala Arg Arg 
1 

Leu Arg Ser Glu 
20 

Ser Gly Ala Glu 
35 

Gly Arg Gly Ala 
50 

Ala Arg His Ser 
65 

Arg Ala Gly Ala 



Leu lie Gly Ser 
100 

Ala Leu Gly Asn 

115 

Phe Thr Pro Leu 
130 

Ala Met Asp lie 
145 

Asn Val Phe Ser 



Ala Ala Ser Arg 
5 

Gly Ser Thr Gin 



Asp Ala Arg His 
40 

Ser lie Tyr Gly 
55 

Pro Ala Ala Ala 
70 

Asp Glu Val Thr 
85 

Cys Cys Cys Ala 



Val lie Lys Lys 
120 

Leu Lys Gly Leu 
135 

Val Leu Arg Arg 
150 

Tyr Asn lie Leu 



Val Arg Ala Gly 
10 

Gly Arg Gly Gly 

25 

Val Phe Asp Glu 



Leu Asn Cys Ala 
60 

Val Ser Arg Tyr 
75 

Pro Asn Leu Cys 
90 

Gly Arg Leu Asp 
105 

Gly Phe Arg Val 



Cys Ala Asp Lys 
140 

Met Thr Gin Leu 
155 

Leu Lys Gly Leu 



Ala Val Gly Ala 
15 

Arg Thr Gly Gly 
30 

Leu Leu Arg Arg 
45 

Leu Ala Asp Val 



Asn Arg Met Ala 
80 

Thr Tyr Gly lie 
95 

Leu Gly Phe Ala 
110 

Asp Ala lie Ala 
125 

Arg Thr Ser Asp 



Gly Cys lie Pro 
160 

Cys Asp Glu Asn 
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165 170 175 

Arg Ser Gin Glu Ala Leu Glu Leu Leu Gin Met Met Pro Asp Asp Gly 
180 185 190 

Gly Asp Cys Pro Pro Asp Val Val Ser Tyr Thr Thr Val lie Asn Gly 
195 200 205 

Phe Phe Lys Glu Gly Asp Leu Asp Lys Ala Tyr Gly Thr Tyr His Glu 
210 215 220 

Met Leu Asp Arg Gly lie Leu Pro Asn Val Val Thr Tyr Ser Ser lie 
225 230 235 240 

He Ala Ala Leu Cys Lys Ala Gin Ala Met Asp Lys Ala Met Glu Val 
245 250 255 

Leu Thr Ser Met Val Lys Asn Gly Val Met Pro Asn Cys Arg Thr Tyr 
260 265 270 

Asn Ser He Val His Gly Tyr Cys Ser Ser Gly Gin Pro Lys Glu Ala 
275 280 285 

He Gly Phe Leu Lys Lys Met His Ser Asp Gly Val Glu Pro Asp Val 
290 295 300 

Val Thr Tyr Asn Ser Leu Met Asp Tyr Leu Cys Lys Asn Gly Arg Cys 
305 310 315 320 

Thr Glu Ala Arg Lys Met Phe Asp Ser Met Thr Lys Arg Gly Leu Lys 
325 330 335 

Pro Glu He Thr Thr Tyr Gly Thr Leu Leu Gin Gly Tyr Ala Thr Lys 
340 345 350 

Gly Ala Leu Val Glu Met His Gly Leu Leu Asp Leu Met Val Arg Asn 
355 360 365 

Gly He His Pro Asn His Tyr Val Phe Ser He Leu He Cys Ala Tyr 
370 375 380 

Ala Lys Gin Gly Lys Val Asp Gin Ala Met Leu Val Phe Ser Lys Met 
385 390 395 400 

Arg Gin Gin Gly Leu Asn Pro Asp Thr Val Thr Tyr Gly Thr Val He 
405 410 415 

Gly He Leu Cys Lys Ser Gly Arg Val Glu Asp Ala Met Arg Tyr Phe 
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420 425 430 

Glu Gin Met He Asp Glu Arg Leu Ser Pro Gly Asn He Val Tyr Asn 
435 440 445 

Ser Leu lie His Ser Leu Cys lie Phe Asp Lys Trp Asp Lys Ala Lys 
450 455 460 

Glu Leu He Leu Glu Met Leu Asp Arg Gly He Cys Leu Asp Thr He 
465 470 475 480 

Phe Phe Asn Ser He He Asp Ser His Cys Lys Glu Gly Arg Val He 
485 490 495 

Glu Ser Glu Lys Leu Phe Asp Leu Met Val Arg He Gly Val Lys Pro 
500 505 510 

Asp He He Thr Tyr Ser Thr Leu He Asp Gly Tyr Cys Leu Ala Gly 
515 520 525 

Lys Met Asp Glu Ala Thr Lys Leu Leu Ala Ser Met Val Ser Val Gly 
530 535 540 

Met Lys Pro Asp Cys Val Thr Tyr Asn Thr Leu He Asn Gly Tyr Cys 
545 550 555 560 

Lys He Ser Arg Met Glu Asp Ala Leu Val Leu Phe Arg Glu Met Glu 
565 570 575 

Ser Ser Gly Val Ser Pro Asp He He Thr Tyr Asn He He Leu Gin 
580 585 590 

Gly Leu Phe Gin Thr Arg Arg Thr Ala Ala Ala Lys Glu Leu Tyr Val 
595 600 605 

Gly He Thr Glu Ser Gly Thr Gin Leu Glu Leu Ser Thr Tyr Asn He 
610 615 620 

He Leu His Gly Leu Cys Lys Asn Asn Leu Thr Asp Glu Ala Leu Arg 
625 630 635 640 

Met Phe Gin Asn Leu Cys Leu Thr Asp Leu Gin Leu Glu Thr Arg Thr 
645 650 655 

Phe Asn He Met He Gly Ala Leu Leu Lys Val Gly Arg Asn Asp Glu 
660 665 670 

Ala Lys Asp Leu Phe Ala Ala Leu Ser Ala Asn Gly Leu Val Pro Asp 
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675 680 685 

Val Arg Thr Tyr Ser Leu Met Ala Glu Asn Leu He Glu Gin Gly Leu 
690 695 700 

Leu Glu Glu Leu Asp Asp Leu Phe Leu Ser Met Glu Glu Asn Gly Cys 
705 710 715 720 

Thr Ala Asn Ser Arg Met Leu Asn Ser He Val Arg Lys Leu Leu Gin 
725 730 735 

Arg Gly Asp He Thr Arg Ala Gly Thr Tyr Leu Phe Met lie Asp Glu 
740 745 750 

Lys His Phe Ser Leu Glu Ala Ser Thr Ala Scr Leu Phe Leu Asp Leu 
755 760 765 

Leu Ser Gly Gly Lys Tyr Gin Glu Tyr His Ser Cys lie Arg Gly Gly 
770 775 780 

He Phe Ser Leu Cys Val Asn Ser Glu Val Gin Glu Asn His Leu Leu 
785 790 795 800 

Asp Ser Glu Ser Gly Val His Phe Leu Leu Lys Leu Leu Asn Pro Pro 
805 810 815 

Val Asn Leu Val Asp Ser Lys Ala Pro Ser He Gly Ser Lys Leu Leu 
820 825 830 

Gly He Ser Lys Val Gin Met Leu Asn Gly Ser Asn Lys Asp Ser Asp 
835 840 845 

Cys He Ser Glu Glu lie Leu Ser Lys Val Glu Glu He Leu Leu Ser 
850 855 860 

Cys Gin Val He Lys Ser Leu Asp Lys Asp Asp Lys Lys Thr Thr Arg 
865 870 875 880 

Pro Glu Leu Cys Pro Lys Trp Leu Ala Leu Leu Thr Met Glu Asn Ala 
885 890 895 

Cys Leu Ser Ala Val Ser Val Glu Glu Thr Ser Asp Thr Val Ser Arg 
900 905 910 

Val Gly Gly Asn Phe Lys Glu Thr Leu Arg Glu Met Gly Gly Leu Asp 
915 920 925 

Ser He Phe Asp Val Met Val Asp Phe His Ser Thr Leu Glu Asn Leu 
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930 935 940 

lie Lys Asp Thr Ser Thr Ser Ala Leu Asp Arg Asn Glu Gly Thr Ser 
945 950 955 960 

Leu Gin Ser Ala Ala Leu Leu Leu Lys Cys Leu Lys lie Leu Glu Asn 
965 970 975 

Ala lie Phe Leu Ser Asp Asp Asn Lys Thr His Leu Leu Asn Met Ser 
980 985 990 

Arg Lys Leu Asn Pro Lys Arg Ser Leu Leu Ser Phe Val Gly Val lie 
995 1000 1005 

lie Asn Thr lie Glu Leu Leu Ser Ala Leu Ser lie Leu Gin Asn Ser 
1010 1015 1020 

Ser Val Val Ser Ser Ser Thr Tyr Pro Lys Ser Ser Lys Val Ser Gin 
1025 1030 1035 1040 

Gin Ser Tyr Ser Val Val Met Ala Gly Gly Asp Arg Gly Arg Gly Val 
1045 1050 1055 

Glu Cys His Pro His Gin Gly Val Ser Ala Ala Leu Leu Arg Pro Gly 
1060 1065 1070 

Pro Gin Ala Leu Ala Ala Ser Trp Arg Arg Arg Glu Thr Val Val Arg 
1075 1080 1085 

Ser Asp Phe Ala Ala Gly Gly Val Ala Thr Met Gly Asp Ser Pro Gin 
1090 1095 1100 

Ala Leu Ser Asp Arg Leu Cys Gly Ser Ala Thr Lys Val Trp Arg Gly 
1105 1110 1115 1120 

Gly Ala Glu Trp Thr Ala Glu Ala Phe Ala Arg Asn Gly Ala Ala Gly 
1125 1130 1135 

Pro Ser Gin Ser Arg Leu Pro Val Thr Arg Asn Arg Ala Gin His Tyr 
1140 1145 1150 

lie Ala Lys lie Trp Ala Thr Leu Thr lie Ser Met Phe Tyr Lys Leu 
1155 1160 1165 

Asp Val Glu Gly Met Glu Asn Leu Pro Pro Asn Ser Ser Pro Ala lie 
1170 1175 1180 

Tyr Val Ala Asn His Gin Ser Phe Leu Asp lie Tyr Thr Leu Leu Thr 
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1185 1190 1195 1200 

Leu Gly Arg Cys Phe Lys Phe lie Ser Lys Thr Ser lie Phe Met Phe 
1205 1210 1215 

Arg lie lie 



<210> 24 

<211> 2859 

<212> DNA 

<213> Oryza sativa 



<400> 24 

atggcgcgcc gcgccgcttc ccgcgctgtt 
gggcgaggag gccgcgcggg gggcagtggc 
ttgctccggc gtggcagggg cgcctcgatc 
gcgcgtcaca gccccgcggc cgccgtgtcc 
gacgaggtaa ctcccgactt gtgcacctac 
ggccgcttgg acctcggttt cgcggccttg 
gaagccatca ccttcactcc tctgctcaag 
gcaatggaca tagtgctccg cagaatgacc 
tacaataatc ttctcaacgg gctgtgtgat 
ctgcacatga tggctgatga tcgaggagga 
actgtcatca atggcttctt caaagagggg 
gaaatgctgg accgggggat tttacctgat 
ttatgcaagg gtcaagctat ggacaagcca 
gaatctgaaa aactctttga cctgatggta 
tacagtacac tcatcgatgg atattgcttg 
ctttctggca tggtctcagt tgggttgaaa 
aatggctact gcaaaattag taggatggaa 
agcagtggtg ttagtcctga tattattacg 
accagaagaa ctgctgctgc aaaagaactc 
attgaactta gcacatacaa cataatcctt 
gatgcacttc agatgtttca gaacctatgt 
ttcaacatta tgattgatgc attgcttaaa 
tttgttgctt tctcgtctaa cggtttagtg 
gaaaatatta taggacaggg gttgctagaa 
gacaatggct gtactgttga ctctggcatg 
agaggagtag tggtggtggt gagtggtgaa 
attctgacat gtgggatcac tgtcaatccc 
cccttctcca aaccaattgt gcagacaggt 
aatgcttctg aagaaaggtt aattgttgtt 
tctccaacaa ttgaggcgct tattttgctc 
caccagttga caacacggct tgttgtacca 
ggtggaaagg taattactga aatgagaaga 
aaagcagata aacctaagta cctgtctttt 



ggcgcccttc gctcggacgg ctcgatccaa 60 
gccgaggacg cacgccacgt gttcgacgaa 120 
tacggcttga accgcgccct cgccgacgtc 180 
cgctacaacc gcatggcccg agctggcgcc 24 0 
ggcattctca tcggttgctg ctgccgcgcg 300 
ggcaatgtca ttaagaaggg atttagagtg 360 
ggcctctgtg ccgacaagag gacgagcgac 420 
gagctcggtt gcataccaaa tgtcttctcc 480 
gagaacagaa gccaagaagc tctcgagttg 540 
ggtagcccac ctgatgtggt gtcgtatacc 600 
gattcagaca aagcttacag tacataccat 660 
gttgtgacct acagctctat tattgctgcg 720 
tggagtcatt gcaaagaagg gagggttata 780 
cgtattggtg tgaagcctga tatcattaca 840 
gcaggtaaga tggatgaagc aatgaagtta 900 
cctaatactg ttacttatag cactttgatt 960 
gacgcgttag ttctttttaa ggagatggag 1020 
tataacataa ttctgcaagg tttatttcaa 1080 
tatgtcagga ttaccgaaag tggaacgcag 1140 
catggacttt gcaaaaacaa actcactgat 1200 
ttgatggatt tgaagcttga ggctaggact 1260 
gttggcagaa atgatgaagc caaggatttg 1320 
ccgaattatt ggacgtacag gttgatggct 1380 
gaattggatc aactctttct ttcaatggag 1440 
ctaaatttca ttgttaggga actgttgcag 1500 
tctgccacca ccccaccacc aactctcaaa 1560 
ttctccaaga catgtgggat cactgtcaat 1620 
gcttgcggtc aggttaaaga agttggcaaa 1680 
tcatctcagg agattccaga tgatccagtg 1740 
catagtaaag caagtacact tgctgagaac 1800 
tcaaacaaag ttggttgtat tcttggggaa 1860 
cggactgggg ctgaaatccg agtctactca 1920 
gatgaggagc ttgtgcagca tatcagcctt 1980 
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atcttggttg atcggcatgc tggacgagca 
atatatgtgc tggtgctgaa tcgatcgatt 
actgctgcct gctgggctct agccgccatc 
ctggtttcac attggtttat aataaaggcc 
aatctcagga ccggccctgc tcatgatcct 
aaaagagagt accctagtgg aagtagcaaa 
gcagcaacta ctcgcttgcc taatggagaa 
tatatgtcct gccgttctta tcttgaccaa 
acactacaat taggcctctc gagagccggg 
agagctggaa attccaatgc ttatgattat 
gaggattacc gaagactgtc aggtctcact 
ttccaaatag ttaactggag tctgtcattg 
ttgcacgaag cccatcctgg ttcttccgag 
caagtgaaag ccgcacagag ccttctgcaa 
caggcgcccc agtcctctcg catggcccat 



catctgttgt cgcatcaact gctgactgct 2040 
gtcgtcgcgg aagtgaagaa caaccacggc 2100 
agttataacc gtacaaactt cagtgatttg 2160 
tccgttttta gtttcacgct gggccttcag 2220 
tacaccgtgt atcctgtaga gtacttctct 2280 
gttgcaccat ctgcttcata cgaaagatat 2340 
ctgccctcat ctattagtcc tggtgccgat 2400 
gtacctactg ataggtactc taatagggtt 2460 
aatagtaatg tgcaacaatt aggaatcacc 2520 
actgaggctg ctgagcagat ccatggacgt 2580 
gggtatccag gtggctcttc gaattgtgga 2640 
gtgttggtga tctctggtgc gagagtgaag 2700 
tccattgtgg agatccaggg cattccggat 2760 
ggcttcatcg gcgcaagcag caacagcagg 2820 
tatttttag 2859 



<210> 25 
<211> 952 
<212> PRT 

<213> Oryza sativa 
<400> 25 

Met Ala Arg Arg Ala Ala Ser Arg Ala Val Gly Ala Leu Arg Ser Asp 
15 10 15 

Gly Ser lie Gin Gly Arg Gly Gly Arg Ala Gly Gly Ser Gly Ala Glu 
20 25 30 

Asp Ala Arg His Val Phe Asp Glu Leu Leu Arg Arg Gly Arg Gly Ala 
35 40 45 

Ser He Tyr Gly Leu Asn Arg Ala Leu Ala Asp Val Ala Arg His Ser 
50 55 60 

Pro Ala Ala Ala Val Ser Arg Tyr Asn Arg Met Ala Arg Ala Gly Ala 
65 70 75 80 

Asp Glu Val Thr Pro Asp Leu Cys Thr Tyr Gly He Leu He Gly Cys 
85 90 95 

Cys Cys Arg Ala Gly Arg Leu Asp Leu Gly Phe Ala Ala Leu Gly Asn 
100 105 HO 

Val He Lys Lys Gly Phe Arg Val Glu Ala He Thr Phe Thr Pro Leu 
115 120 125 

Leu Lys Gly Leu Cys Ala Asp Lys Arg Thr Ser Asp Ala Met Asp He 
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130 135 140 

Val Leu Arg Arg Met Thr Glu Leu Gly Cys lie Pro Asn Val Phe Ser 
145 150 155 160 

Tyr Asn Asn Leu Leu Asn Gly Leu Cys Asp Glu Asn Arg Ser Gin Glu 
165 170 175 

Ala Leu Glu Leu Leu His Met Met Ala Asp Asp Arg Gly Gly Gly Ser 
180 185 190 

Pro Pro Asp Val Val Ser Tyr Thr Thr Val lie Asn Gly Phe Phe Lys 
195 200 205 

Glu Gly Asp Ser Asp Lys Ala Tyr Ser Thr Tyr His Glu Met Leu Asp 
210 215 220 

Arg Gly lie Leu Pro Asp Val Val Thr Tyr Ser Ser lie lie Ala Ala 
225 230 235 240 

Leu Cys Lys Gly Gin Ala Met Asp Lys Pro Trp Ser His Cys Lys Glu 
245 250 255 

Gly Arg Val lie Glu Ser Glu Lys Leu Phe Asp Leu Met Val Arg lie 
260 265 270 

Gly Val Lys Pro Asp lie lie Thr Tyr Ser Thr Leu lie Asp Gly Tyr 
275 280 285 

Cys Leu Ala Gly Lys Met Asp Glu Ala Met Lys Leu Leu Ser Gly Met 
290 295 300 

Val Ser Val Gly Leu Lys Pro Asn Thr Val Thr Tyr Ser Thr Leu lie 
305 310 315 320 

Asn Gly Tyr Cys Lys lie Ser Arg Met Glu Asp Ala Leu Val Leu Phe 
325 330 335 

Lys Glu Met Glu Ser Ser Gly Val Ser Pro Asp lie lie Thr Tyr Asn 
340 345 350 

lie lie Leu Gin Gly Leu Phe Gin Thr Arg Arg Thr Ala Ala Ala Lys 
355 360 365 

Glu Leu Tyr Val Arg lie Thr Glu Ser Gly Thr Gin lie Glu Leu Ser 
370 375 380 

Thr Tyr Asn lie lie Leu His Gly Leu Cys Lys Asn Lys Leu Thr Asp 
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385 390 395 400 

Asp Ala Leu Gin Met Phe Gin Asn Leu Cys Leu Met Asp Leu Lys Leu 
405 410 415 

Glu Ala Arg Thr Phe Asn lie Met lie Asp Ala Leu Leu Lys Val Gly 
420 425 430 

Arg Asn Asp Glu Ala Lys Asp Leu Phe Val Ala Phe Ser Ser Asn Gly 
435 440 445 

Leu Val Pro Asn Tyr Trp Thr Tyr Arg Leu Met Ala Glu Asn lie lie 
450 455 460 

Gly Gin Gly Leu Leu Glu Glu Leu Asp Gin Leu Phe Leu Ser Met Glu 
465 470 475 480 

Asp Asn Gly Cys Thr Val Asp Ser Gly Met Leu Asn Phe lie Val Arg 
485 490 495 

Glu Leu Leu Gin Arg Gly Val Val Val Val Val Ser Gly Glu Ser Ala 
500 505 510 

Thr Thr Pro Pro Pro Thr Leu Lys He Leu Thr Cys Gly He Thr Val 
515 520 525 

Asn Pro Phe Ser Lys Thr Cys Gly He Thr Val Asn Pro Phe Ser Lys 
530 535 540 

Pro He Val Gin Thr Gly Ala Cys Gly Gin Val Lys Glu Val Gly Lys 
545 550 555 560 

Asn Ala Ser Glu Glu Arg Leu He Val Val Ser Ser Gin Glu He Pro 
565 570 575 

Asp Asp Pro Val Ser Pro Thr He Glu Ala Leu He Leu Leu His Ser 
580 585 590 

Lys Ala Ser Thr Leu Ala Glu Asn His Gin Leu Thr Thr Arg Leu Val 
595 600 605 

Val Pro Ser Asn Lys Val Gly Cys He Leu Gly Glu Gly Gly Lys Val 
610 615 620 

He Thr Glu Met Arg Arg Arg Thr Gly Ala Glu He Arg Val Tyr Ser 
625 630 635 640 

Lys Ala Asp Lys Pro Lys Tyr Leu Ser Phe Asp Glu Glu Leu Val Gin 
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645 650 655 

His lie Ser Leu lie Leu Val Asp Arg His Ala Gly Arg Ala His Leu 
660 665 670 

Leu Ser His Gin Leu Leu Thr Ala lie Tyr Val Leu Val Leu Asn Arg 
675 680 685 

Ser lie Val Val Ala Glu Val Lys Asn Asn His Gly Thr Ala Ala Cys 
690 695 700 

Trp Ala Leu Ala Ala lie Ser Tyr Asn Arg Thr Asn Phe Ser Asp Leu 
705 710 715 720 

Leu Val Ser His Trp Phe lie He Lys Ala Ser Val Phe Ser Phe Thr 
725 730 735 

Leu Gly Leu Gin Asn Leu Arg Thr Gly Pro Ala His Asp Pro Tyr Thr 
740 745 750 

Val Tyr Pro Val Glu Tyr Phe Ser Lys Arg Glu Tyr Pro Ser Gly Ser 
755 760 765 

Ser Lys Val Ala Pro Ser Ala Ser Tyr Glu Arg Tyr Ala Ala Thr Thr 
770 775 780 

Arg Leu Pro Asn Gly Glu Leu Pro Ser Ser He Ser Pro Gly Ala Asp 
785 790 795 800 

Tyr Met Ser Cys Arg Ser Tyr Leu Asp Gin Val Pro Thr Asp Arg Tyr 
805 810 815 

Ser Asn Arg Val Thr Leu Gin Leu Gly Leu Ser Arg Ala Gly Asn Ser 
820 825 830 

Asn Val Gin Gin Leu Gly He Thr Arg Ala Gly Asn Ser Asn Ala Tyr 
835 840 845 

Asp Tyr Thr Glu Ala Ala Glu Gin He His Gly Arg Glu Asp Tyr Arg 
850 855 860 

Arg Leu Ser Gly Leu Thr Gly Tyr Pro Gly Gly Ser Ser Asn Cys Gly 
865 870 875 880 

Phe Gin He Val Asn Trp Ser Leu Ser Leu Val Leu Val He Ser Gly 
885 890 895 

Ala Arg Val Lys Leu His Glu Ala His Pro Gly Ser Ser Glu Ser lie 
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900 905 910 

Val Glu lie Gin Gly lie Pro Asp Gin Val Lys Ala Ala Gin Ser Leu 
915 920 925 

Leu Gin Gly Phe lie Gly Ala Ser Ser Asn Ser Arg Gin Ala Pro Gin 
930 935 940 

Ser Ser Arg Met Ala His Tyr Phe 
945 950 



<210> 26 

<211> 1737 

<212> DNA 

<213> Oryza sativa 

<400> 26 

atgccgctcg ccacgctgct cggccacctc gccgccggcc gcttcggcct cgtgcaggcg 60 

ctcaccggcg ccgcgaccgc ggcggccgcg caccgactcc tccacctcct cctccgcaca 120 

gcgccgccgc ctcccctccc ggacctcgtc tccctcgcgc ggtggtcgcg cgcccacttc 180 

cgcgcgccgc tcccgctccg gctccacggg ctcctcctcg cccgcctcgc ctccaagggg 240 

ctctaccccc tcctccgctc cgagctccac gtcctcgccg cggcgcgcct ccactccccc 300 

gcatccatcc tccgcgctct cccctccccg tccgcgtccg cgtccgcatc cacgccgctc 360 

atcgccgaca tgctcgtcct cgccctcgcc agggcatccc agcccctcag ggcgtacgac 420 

gcgttcctcc tcgccgggga gagccacccg cggcaccgcc cctccacctc ctccgtgaac 4 80 

gcccttctcg ccggcctcgt cggcgccaag cgggtcgacc tcgccgagaa ggcgttcagg 540 

agcgcgctgc ggcggcgcgt gtcaccggac atctacacct tcaacaccgt catctccggc 600 

ctctgcagga tcggccagct ccgcaaagcc ggcgatgtcg ccaaggacat caaggcatgg 660 

ggtctggctc cctctgtggc cacctacaat agcctcatcg atgggtactg caagaagggt 720 

ggagctggga acatgtacca tgtcgacatg cttttgaagg agatggtcga agccgggatc 780 

tcaccgactg cagttacatt tggtgtgttg atcaatgggt attgcaagaa ctcgaatact 840 

gcggccgcag tgagagtctt cgaggagatg aagcagcagg ggatcgctgc gagtgtcgtg 900 

acgtataatt cgctaatttc aggtctctgc agtgagggta aggtggagga aggggtgaag 960 

ctgatggagg agatggagga tttggggctg tcacccaatg aaatcacctt tggctgtgtt 1020 

ctgaaagggt tttgtaagaa gggaatgatg gcagatgcca atgattggat tgatggtatg 1080 

acagagagga atgtggaacc tgatgtggtt atttacaata tcttgatcga tgtgtatcgc 1140 

cgtcttggaa aaatggagga tgcaatggcg gtgaaggagg caatggcaaa gaaggggatc 1200 

agtcccaatg tcacaacata taattgcttg ataacagggt ttagccgcag tggggattgg 1260 

aggagtgctt ctggccttct ggatgagatg aaggagaaag gtattgaagc agacgtcgtc 1320 

acttacaatg tgcttattgg tgctttgtgc tgcaaaggtg aggtacggaa agctgtaaag 1380 

ctcttggatg aaatgtcgga agttggattg gaaccaaacc atctgaccta caataccata 1440 

atacaggggt tctgtgataa gggtaacatt aagtctgcct atgaaattag aaccaggatg 1500 

gaaaaatgtc ggaaacgggc aaatgtggtt acgtacaatg tgttcatcaa gtatttctgc 1560 

cagataggga agatggatga agctaatgat ctactcaatg agatgttgga caaatgtcta 1620 

gttccaaacg ggatcactta tgaaacgata aaagagggga tgatggaaaa aggctataca 1680 

ccagatatta gagggtgcac tgtctcacaa gcttctgaaa acccagcatc atcctga 1737 
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<210> 27 
<211> 578 
<212> PRT 

<213> Oryza sativa 
<400> 27 

Met Pro Leu Ala Thr Leu Leu Gly His Leu Ala Ala Gly Arg Phe Gly 
15 10 15 

Leu Val Gin Ala Leu Thr Gly Ala Ala Thr Ala Ala Ala Ala His Arg 
20 25 30 

Leu Leu His Leu Leu Leu Arg Thr Ala Pro Pro Pro Pro Leu Pro Asp 
35 40 45 

Leu Val Ser Leu Ala Arg Trp Ser Arg Ala His Phe Arg Ala Pro Leu 
50 55 60 

Pro Leu Arg Leu His Gly Leu Leu Leu Ala Arg Leu Ala Ser Lys Gly 
65 70 75 80 

Leu Tyr Pro Leu Leu Arg Ser Glu Leu His Val Leu Ala Ala Ala Arg 
85 90 95 

Leu His Ser Pro Ala Ser lie Leu Arg Ala Leu Pro Ser Pro Ser Ala 
100 105 HO 

Ser Ala Ser Ala Ser Thr Pro Leu lie Ala Asp Met Leu Val Leu Ala 
115 120 125 

Leu Ala Arg Ala Ser Gin Pro Leu Arg Ala Tyr Asp Ala Phe Leu Leu 
130 135 140 

Ala Gly Glu Ser His Pro Arg His Arg Pro Ser Thr Ser Ser Val Asn 
145 150 155 160 

Ala Leu Leu Ala Gly Leu Val Gly Ala Lys Arg Val Asp Leu Ala Glu 
165 170 175 

Lys Ala Phe Arg Ser Ala Leu Arg Arg Arg Val Ser Pro Asp He Tyr 
180 185 190 

Thr Phe Asn Thr Val He Ser Gly Leu Cys Arg He Gly Gin Leu Arg 
195 200 205 

Lys Ala Gly Asp Val Ala Lys Asp He Lys Ala Trp Gly Leu Ala Pro 
210 215 220 
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Ser Val Ala Thr Tyr Asn Ser Leu He Asp Gly Tyr Cys Lys Lys Gly 
225 230 235 240 

Gly Ala Gly Asn Met Tyr His Val Asp Met Leu Leu Lys Glu Met Val 
245 250 255 

Glu Ala Gly He Ser Pro Thr Ala Val Thr Phe Gly Val Leu He Asn 
260 265 270 

Gly Tyr Cys Lys Asn Ser Asn Thr Ala Ala Ala Val Arg Val Phe Glu 
275 280 285 

Glu Met Lys Gin Gin Gly He Ala Ala Ser Val Val Thr Tyr Asn Ser 
290 295 300 

Leu He Ser Gly Leu Cys Ser Glu Gly Lys Val Glu Glu Gly Val Lys 
305 310 315 320 

Leu Met Glu Glu Met Glu Asp Leu Gly Leu Ser Pro Asn Glu He Thr 
325 330 335 

Phe Gly Cys Val Leu Lys Gly Phe Cys Lys Lys Gly Met Met Ala Asp 
340 345 350 

Ala Asn Asp Trp He Asp Gly Met Thr Glu Arg Asn Val Glu Pro Asp 
355 360 365 

Val Val He Tyr Asn He Leu He Asp Val Tyr Arg Arg Leu Gly Lys 
370 375 380 

Met Glu Asp Ala Met Ala Val Lys Glu Ala Met Ala Lys Lys Gly He 
385 390 395 400 

Ser Pro Asn Val Thr Thr Tyr Asn Cys Leu He Thr Gly Phe Ser Arg 
405 410 415 

Ser Gly Asp Trp Arg Ser Ala Ser Gly Leu Leu Asp Glu Met Lys Glu 
420 425 430 

Lys Gly He Glu Ala Asp Val Val Thr Tyr Asn Val Leu He Gly Ala 
435 440 445 

Leu Cys Cys Lys Gly Glu Val Arg Lys Ala Val Lys Leu Leu Asp Glu 
450 455 460 

Met Ser Glu Val Gly Leu Glu Pro Asn His Leu Thr Tyr Asn Thr He 
465 470 475 480 
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lie Gin Gly Phe Cys 
485 

Arg Thr Arg Met Glu 
500 

Asn Val Phe lie Lys 

515 

Asn Asp Leu Leu Asn 
530 

lie Thr Tyr Glu Thr 
545 

Pro Asp lie Arg Gly 
565 

Ser Ser 



Asp Lys Gly Asn lie 
490 

Lys Cys Arg Lys Arg 
505 

Tyr Phe Cys Gin lie 
520 

Glu Met Leu Asp Lys 
535 

lie Lys Glu Gly Met 
550 

Cys Thr Val Ser Gin 
570 



Lys Ser Ala Tyr Glu lie 
495 

Ala Asn Val Val Thr Tyr 
510 

Gly Lys Met Asp Glu Ala 
525 

Cys Leu Val Pro Asn Gly 
540 

Met Glu Lys Gly Tyr Thr 
555 560 

Ala Ser Glu Asn Pro Ala 
575 



<210> 28 

<211> 1365 

<212> DNA 

<213> Oryza sativa 

<400> 28 

atggctgatg atggtcgctg cccacctgat gtggtgtcgt ataataccat cattgatggt 60 
ctcttcaaag agggtgatgt ggacaaagct tacatcacat accatgaaat gctggaccgg 120 
agggtttctc cagatgctgt gacttacaac tctatcattg ctgccttaag caaggctcaa 180 
gctatggaca gggccatgga ggtacttaca gtgatggtta tgcccaattg cttcacatat 240 
aatagtatta tgcatggata ttgttcttca ggacagtcgg aaaaggctat tggtattttc 300 
agaaagatgt gcagtgatgg tattgaacca gatgttgtta cttataactc gttgatggac 360 
tatctctgca agaacggaaa atgcacagaa gccagaaaga tttttgattc tatggtcaag 420 
aggggtctca agcctgatat tactacctat ggtaccctgc ttcatgggta tgcttccaaa 480 
ggagctcttg ttgagatgca tgatctctta gctttgatgg tacaaaatgg catgcaactt 540 
gatcatcatg tcttcaacat attaatatgt gcatacacta aacaagaaaa agtagacgag 600 
gtcgtgcttg tattcagcaa aatgaggcag caaggattga ctccgaacgc agtgaactat 660 
agaacagtga tagatggact ttgcaagtta ggtagactag atgatgctat gcttaatttt 720 
gagcagatga ttgataaagg actgacacct aacgttgttg tttatacctc cctaattcat 780 
gctctctgta cctatgacaa atgggagaag gccgaggagt taatttttga aatattggat 840 
caaggtatca atcccaacat tgtgtttttt aatacaatat tggacagtct ttgcaaagaa 900 
gggagggtta tagaatctaa aaaactcttt gacctgttgg gacatattgg tgtgaatcct 960 
gatgtcatta catacagtac actcatcgat ggatattgct tagctggtaa gatggatgga 1020 
gcaatgaagt tactcactgg catggtctca gttgggttga aacctgatag tgttacatat 1080 
agcactttga ttaatggtta ctgtaaaatt aatagaatgg aggacgcatt agctcttttc 1140 
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aaggagatgg aaagcaatgg tgttaatcct gatattatta catataacat aattctgcat 1200 
ggtttatttc gcaccagaag aactgctgct gcaaaagaac tatatgccag gattaccgaa 1260 
agtggaacgc agcttgaact tagcacatac aacataatcc tcatggactt tgcaaaaaca 1320 
aactcactga tgatgcactt cggatgtttc agaacctatg tttga 1365 



<210> 29 
<211> 454 
<212> PRT 

<213> Oryza sativa 
<400> 29 

Met Ala Asp Asp Gly Arg Cys Pro Pro Asp Val Val Ser Tyr Asn Thr 
15 10 15 

lie He Asp Gly Leu Phe Lys Glu Gly Asp Val Asp Lys Ala Tyr He 
20 25 30 

Thr Tyr His Glu Met Leu Asp Arg Arg Val Ser Pro Asp Ala Val Thr 
35 40 45 

Tyr Asn Ser He He Ala Ala Leu Ser Lys Ala Gin Ala Met Asp Arg 
50 55 60 

Ala Met Glu Val Leu Thr Val Met Val Met Pro Asn Cys Phe Thr Tyr 
65 70 75 80 

Asn Ser He Met His Gly Tyr Cys Ser Ser Gly Gin Ser Glu Lys Ala 
85 90 95 

He Gly He Phe Arg Lys Met Cys Ser Asp Gly He Glu Pro Asp Val 
100 105 HO 

Val Thr Tyr Asn Ser Leu Met Asp Tyr Leu Cys Lys Asn Gly Lys Cys 
115 120 125 

Thr Glu Ala Arg Lys He Phe Asp Ser Met Val Lys Arg Gly Leu Lys 
130 135 140 

Pro Asp He Thr Thr Tyr Gly Thr Leu Leu His Gly Tyr Ala Ser Lys 
145 150 155 160 

Gly Ala Leu Val Glu Met His Asp Leu Leu Ala Leu Met Val Gin Asn 
165 170 175 

Gly Met Gin Leu Asp His His Val Phe Asn He Leu He Cys Ala Tyr 
180 185 190 
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Thr Lys Gin Glu Lys Val Asp Glu Val Val Leu Val Phe Ser Lys Met 
195 200 205 

Arg Gin Gin Gly Leu Thr Pro Asn Ala Val Asn Tyr Arg Thr Val lie 
210 215 220 

Asp Gly Leu Cys Lys Leu Gly Arg Leu Asp Asp Ala Met Leu Asn Phe 
225 230 235 240 

Glu Gin Met lie Asp Lys Gly Leu Thr Pro Asn Val Val Val Tyr Thr 
245 250 255 

Ser Leu lie His Ala Leu Cys Thr Tyr Asp Lys Trp Glu Lys Ala Glu 

260 265 270 

Glu Leu lie Phe Glu lie Leu Asp Gin Gly lie Asn Pro Asn lie Val 
275 280 285 

Phe Phe Asn Thr lie Leu Asp Ser Leu Cys Lys Glu Gly Arg Val lie 
290 295 300 

Glu Ser Lys Lys Leu Phe Asp Leu Leu Gly His lie Gly Val Asn Pro 
305 310 315 320 

Asp Val lie Thr Tyr Ser Thr Leu lie Asp Gly Tyr Cys Leu Ala Gly 
325 330 335 

Lys Met Asp Gly Ala Met Lys Leu Leu Thr Gly Met Val Ser Val Gly 
340 345 350 

Leu Lys Pro Asp Ser Val Thr Tyr Ser Thr Leu lie Asn Gly Tyr Cys 
355 360 365 

Lys lie Asn Arg Met Glu Asp Ala Leu Ala Leu Phe Lys Glu Met Glu 
370 375 380 

Ser Asn Gly Val Asn Pro Asp lie lie Thr Tyr Asn lie lie Leu His 
385 390 395 400 

Gly Leu Phe Arg Thr Arg Arg Thr Ala Ala Ala Lys Glu Leu Tyr Ala 
405 410 415 

Arg lie Thr Glu Ser Gly Thr Gin Leu Glu Leu Ser Thr Tyr Asn lie 
420 425 430 

lie Leu Met Asp Phe Ala Lys Thr Asn Ser Leu Met Met His Phe Gly 
435 440 445 
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Cys Phe Arg Thr Tyr Val 
450 



<210> 30 

<211> 1386 

<212> DNA 

<213> Oryza sativa 

<400> 30 

atggcgcgcc gcgccgcttc ccgcgctgtt ggctcggagg gctcgatcca agggcgaggg 60 

ggccgcgcgg ggggcaatgg cgccgaggac gcacgccacg tgttcgacga attgcttcgg 120 

cgtggcaagg gcgccacgat ctacggcttg aaccgcgccc tcgacgacgt cgcgcgtcac 180 

agccccgcgg ccgccgtgtc ccgctacaac cgcatggccc gagccggcgc cgacgaggta 24 0 

actcccaact tgtacaccta cagcgttctc atcggttgct gctgccgggc gggccgcttg 300 

gacctcggtt tcgcggcctt gggcaatgtc attaagaagg gatttagagt ggaagccatc 360 

accttcactc ctctgctcaa gggcctctgt gccgacaaga ggacgagcga cgcaatggac 420 

atagtgctct gcagaatgac ccagctcggc tgcataccaa atgtcttctc ctgcaccatt 480 

cttctcaagg gtctgtgtga tgagaacaga agccaagaag ctctcgagct gctccaaatg 540 

atgcctgatg atggaggtga ctgcccacct gatgtggtgt tgtacaacac cgtcatcaat 600 

ggcttcttca aagaggggga tccggacaaa gcttacgcta cataccatga aatgtttgac 660 

caggggattt tgccagatgt tgtgacttac agctctatta tcgctgcctt atgcaaggct 720 

caagctatgg acaaggccat ggaggtactt aacaccatgg ttaagaatgg tgtcatgcct 780 

aattgcagga catataatag tattgtgcac ggatattgct cttcagggca gttgacagag 840 

gctattggat ttctcaaaat gatgtgcagt gatggtgtcg aaccagatgt tgttacttgt 900 

aacttgctga tggattatct ttgcaagaac agaagatgca cggaagctag aaagattttc 960 

aattctatga ccaagtgtgg cctaaagcct gatattacta cctattgtac cctgcttcag 1020 

gggtatgcta ccaaaggagc ccttgttgag atgcatgatc tcctggattt gatggtatgg 1080 

aacggtatcc aacctaatca tcatgtattc aacattctaa tatgtgcata cgctaaacaa 1140 

gaaaaagtag atgaggcgat gcttgtattc agcaaaatga ggcagcaagg attgagtccg 1200 

aatgcagtga actacagaac agtcatagat gtactctgca agctaggcag agtatacgat 1260 

gcagtgctta ccttaaagca gatgatcaat gaaggactaa cccctgacat cattgtatat 1320 

acccccctaa ttcatggttt ttgtacctgt gacaaatggg agaaggctga ggagttaatt 1380 

ttttaa 1386 



<210> 31 
<211> 461 
<212> PRT 

<213> Oryza sativa 
<400> 31 

Met Ala Arg Arg Ala Ala Ser Arg Ala Val Gly Ser Glu Gly Ser lie 
15 10 15 

Gin Gly Arg Gly Gly Arg Ala Gly Gly Asn Gly Ala Glu Asp Ala Arg 
20 25 30 
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His Val Phe Asp Glu Leu Leu Arg Arg Gly Lys Gly Ala Thr lie Tyr 
35 40 45 

Gly Leu Asn Arg Ala Leu Asp Asp Val Ala Arg His Ser Pro Ala Ala 
50 55 60 

Ala Val Ser Arg Tyr Asn Arg Met Ala Arg Ala Gly Ala Asp Glu Val 
65 70 75 80 

Thr Pro Asn Leu Tyr Thr Tyr Ser Val Leu lie Gly Cys Cys Cys Arg 
85 90 95 

Ala Gly Arg Leu Asp Leu Gly Phe Ala Ala Leu Gly Asn Val lie Lys 
100 105 110 

Lys Gly Phe Arg Val Glu Ala lie Thr Phe Thr Pro Leu Leu Lys Gly 
115 120 125 

Leu Cys Ala Asp Lys Arg Thr Ser Asp Ala Met Asp lie Val Leu Cys 
130 135 140 

Arg Met Thr Gin Leu Gly Cys lie Pro Asn Val Phe Ser Cys Thr lie 
145 150 155 160 

Leu Leu Lys Gly Leu Cys Asp Glu Asn Arg Ser Gin Glu Ala Leu Glu 
165 170 175 

Leu Leu Gin Met Met Pro Asp Asp Gly Gly Asp Cys Pro Pro Asp Val 
180 185 190 

Val Leu Tyr Asn Thr Val lie Asn Gly Phe Phe Lys Glu Gly Asp Pro 
195 200 205 

Asp Lys Ala Tyr Ala Thr Tyr His Glu Met Phe Asp Gin Gly lie Leu 
210 215 220 

Pro Asp Val Val Thr Tyr Ser Ser lie lie Ala Ala Leu Cys Lys Ala 
225 230 235 240 

Gin Ala Met Asp Lys Ala Met Glu Val Leu Asn Thr Met Val Lys Asn 
245 250 255 

Gly Val Met Pro Asn Cys Arg Thr Tyr Asn Ser lie Val His Gly Tyr 
260 265 270 

Cys Ser Ser Gly Gin Leu Thr Glu Ala lie Gly Phe Leu Lys Met Met 
275 280 285 
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Cys Ser Asp Gly Val Glu 
290 

Asp Tyr Leu Cys Lys Asn 
305 310 

Asn Ser Met Thr Lys Cys 
325 

Thr Leu Leu Gin Gly Tyr 
340 

Asp Leu Leu Asp Leu Met 
355 

Val Phe Asn lie Leu lie 
370 

Glu Ala Met Leu Val Phe 
385 390 

Asn Ala Val Asn Tyr Arg 
405 

Arg Val Tyr Asp Ala Val 
420 

Leu Thr Pro Asp lie lie 
435 

Thr Cys Asp Lys Trp Glu 
450 



Pro Asp Val Val Thr 
295 

Arg Arg Cys Thr Glu 
315 

Gly Leu Lys Pro Asp 
330 

Ala Thr Lys Gly Ala 
345 

Val Trp Asn Gly lie 
360 

Cys Ala Tyr Ala Lys 
375 

Ser Lys Met Arg Gin 
395 

Thr Val lie Asp Val 
410 

Leu Thr Leu Lys Gin 
425 

Val Tyr Thr Pro Leu 
440 

Lys Ala Glu Glu Leu 
455 



Cys Asn Leu Leu Met 
300 

Ala Arg Lys lie Phe 
320 

lie Thr Thr Tyr Cys 
335 

Leu Val Glu Met His 
350 

Gin Pro Asn His His 
365 

Gin Glu Lys Val Asp 
380 

Gin Gly Leu Ser Pro 
400 

Leu Cys Lys Leu Gly 
415 

Met lie Asn Glu Gly 
430 

lie His Gly Phe Cys 
445 

lie Phe 
460 



<210> 32 

<211> 1521 

<212> DNA 

<213> Oryza sativa 

<400> 32 

atggcacgcc gcgtcgctgc ccgcgcccgc gcccgcgccg gcggcgtccc gcgctcggag 60 
ggtacgatcc aagaccgagc acgcgttggg agcggtggcg ccgaggacgc actcgacgtg 120 
ttcgacgaat tgctccggcg aggcatcggc gctccgatcc gcagcttgaa cggcgctctc 180 
gccgacgtcg cgcgcgacaa ccccgcggcc gctgtgtccc gcttcaaccg catggcacga 240 
gctggtgcca gcatggtaac tcccaccgtg cacacctatg gcatcctcat cggctgctgc 300 
tgcagtgcgg gccgcttaga cctcggtttc gcggccttgg gccatgtcgt taagaaggga 360 
ttcagagtgg aacccatcat ctttaatcct ctgctcaagg gcctctgtgc agacaagagg 420 
acggacgacg caatggacat agtgctccgt ggaatgaccg agctcagctg cgtgccaaat 480 
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gtcttctccc acaccattat tctcaaggga 
ctcgagctgc tccacatgat ggctgatgat 
tacagcaccg tcatcgatgg cctcttgaaa 
taccgtgaaa tgcttgaccg gaggattttg 
gctgccctat gcaagggtca agcaatggac 
aagaatggag ttacacccaa ttgcttcacg 
tcagggcagt tgacagaggc tattaaattt 
ccaaatgttg ttacttatag ctcgtttatg 
gaagctagaa agatttttga ttctatggtc 
tacagtagct tacttcatgg gtatgctatc 
tttgatttga tggtacaaag tgatatgcaa 
tatgcatccg ccaagcaagg aaaagtagat 
cagcaaggat tgaaacctaa ttgtgttacg 
attactagga tggagaatgc tttagcactt 
cctaatttta tcacatataa cataatgctg 
actgcaaaag aattctatgt acagattatc 
gggttgctag aagaattgga tgatctattt 
gtgtcgactc ctgcatgcta a 



ctctgtcatg agaacagaag ccaagaagct 540 
ggaggaggct gcttacctaa tgttgtgtca 600 
ggaggggatc cggacaaagc ctacgctaca 660 
ccaaatgttg tgatttacag ctccattatt 720 
aaggccatgg aggtacacga taggatggtt 780 
tatactagtc ttgtgcatgg attttgctct 840 
ctagaaaaga tgtgcagcaa tggtgttgaa 900 
gactatctct gcaagaacgg aagatgcaca 960 
aagaggggcc taaagcctga tattactacc 1020 
gaaggagctc ttgttgagat gcatggtctc 1080 
cccgatcatt atgtcttcaa cacactaata 1140 
gaggccatgc ttgtatttag caaaatgagg 1200 
tatagcactt tgattaatgg ctactgtaaa 1260 
ttccaagaga tggtgagcaa tggtgttagt 1320 
caaggtttat ttcgtacagg aagaactgct 1380 
aaaagtggca aaaaagatct tatagaacag 1440 
ctttcaatgg aggacaatga ctgtagtact 1500 

1521 



<210> 33 
<211> 506 
<212> PRT 

<213> Oryza sativa 
<400> 33 

Met Ala Arg Arg Val Ala Ala Arg Ala Arg Ala Arg Ala Gly Gly Val 
15 10 15 

Pro Arg Ser Glu Gly Thr He Gin Asp Arg Ala Arg Val Gly Ser Gly 
20 25 30 

Gly Ala Glu Asp Ala Leu Asp Val Phe Asp Glu Leu Leu Arg Arg Gly 
35 40 45 

He Gly Ala Pro He Arg Ser Leu Asn Gly Ala Leu Ala Asp Val Ala 
50 55 60 

Arg Asp Asn Pro Ala Ala Ala Val Ser Arg Phe Asn Arg Met Ala Arg 
65 70 75 80 

Ala Gly Ala Ser Met Val Thr Pro Thr Val His Thr Tyr Gly He Leu 
85 90 95 

He Gly Cys Cys Cys Ser Ala Gly Arg Leu Asp Leu Gly Phe Ala Ala 
100 105 110 

Leu Gly His Val Val Lys Lys Gly Phe Arg Val Glu Pro He He Phe 
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115 120 125 

Asn Pro Leu Leu Lys Gly Leu Cys Ala Asp Lys Arg Thr Asp Asp Ala 
130 135 140 

Met Asp lie Val Leu Arg Gly Met Thr Glu Leu Ser Cys Val Pro Asn 
145 150 155 160 

Val Phe Ser His Thr lie lie Leu Lys Gly Leu Cys His Glu Asn Arg 
165 170 175 

Ser Gin Glu Ala Leu Glu Leu Leu His Met Met Ala Asp Asp Gly Gly 
180 185 190 

Gly Cys Leu Pro Asn Val Val Ser Tyr Ser Thr Val lie Asp Gly Leu 
195 200 205 

Leu Lys Gly Gly Asp Pro Asp Lys Ala Tyr Ala Thr Tyr Arg Glu Met 
210 215 220 

Leu Asp Arg Arg lie Leu Pro Asn Val Val lie Tyr Ser Ser lie lie 
225 230 235 240 

Ala Ala Leu Cys Lys Gly Gin Ala Met Asp Lys Ala Met Glu Val His 
245 250 255 

Asp Arg Met Val Lys Asn Gly Val Thr Pro Asn Cys Phe Thr Tyr Thr 
260 265 270 

Ser Leu Val His Gly Phe Cys Ser Ser Gly Gin Leu Thr Glu Ala lie 
275 280 285 

Lys Phe Leu Glu Lys Met Cys Ser Asn Gly Val Glu Pro Asn Val Val 
290 295 300 

Thr Tyr Ser Ser Phe Met Asp Tyr Leu Cys Lys Asn Gly Arg Cys Thr 
305 310 315 320 

Glu Ala Arg Lys lie Phe Asp Ser Met Val Lys Arg Gly Leu Lys Pro 
325 330 335 

Asp lie Thr Thr Tyr Ser Ser Leu Leu His Gly Tyr Ala lie Glu Gly 
340 345 350 

Ala Leu Val Glu Met His Gly Leu Phe Asp Leu Met Val Gin Ser Asp 
355 360 365 

Met Gin Pro Asp His Tyr Val Phe Asn Thr Leu lie Tyr Ala Ser Ala 

40 



WO 03/057859 



PCT/US03/00887 



370 

Lys Gin Gly Lys 
385 

Gin Gin Gly Leu 



Gly Tyr Cys Lys 
420 

Glu Met Val Ser 
435 

Met Leu Gin Gly 
450 

Phe Tyr Val Gin 
465 

Gly Leu Leu Glu 



Asp Cys Ser Thr 
500 



375 

Val Asp Glu Ala 
390 

Lys Pro Asn Cys 
405 

lie Thr Arg Met 



Asn Gly Val Ser 
440 

Leu Phe Arg Thr 
455 

lie lie Lys Ser 
470 

Glu Leu Asp Asp 
485 

Val Ser Thr Pro 



380 

Met Leu Val Phe 
395 

Val Thr Tyr Ser 
410 

Glu Asn Ala Leu 
425 

Pro Asn Phe lie 



Gly Arg Thr Ala 
460 

Gly Lys Lys Asp 
475 

Leu Phe Leu Ser 
4 90 

Ala Cys 
505 



Ser Lys Met Arg 
400 

Thr Leu lie Asn 
415 

Ala Leu Phe Gin 
430 

Thr Tyr Asn lie 
445 

Thr Ala Lys Glu 



Leu lie Glu Gin 

480 

Met Glu Asp Asn 
495 



<210> 34 
<211> 1884 
<212> DNA 

<213> Oryza sativa 
<400> 34 

atggcgcgcc gcgccgcttc ccgcgctgct ggcgcccttc gctcggaggg ctcgatccaa 60 

gggcgagggg gccgcgcggg gggcagtggc ggtggcgcgg aggacgcacg ccacgtgttc 120 

gacgaattgc tccgtcgtgg cataccagat gtcttctcct acaatattct tctcaacggg 180 

ctgtgtgatg agaacagaag ccaagaagct ctcgagttac tgcacataat ggctgatgat 240 

ggaggtgact gcccacctga tgtggtgtcg tacagcaccg tcatcaatgg cttcttcaag 300 

gagggggatc tggacaaaat gcttgaccag aggatttcgc caaatgttgt gacctacaac 360 

tctattattg ctgcgctatg caaggctcaa actgtggaca aggccatgga ggtacttacc 420 

accatggtta agagtggtgt catgcctgat tgcatgacat ataatagtat tgtgcatggg 480 

ttttgctctt cagggcagcc gaaagaggct attgtatttc tcaaaaagat gcgcagtgat 540 

ggtgtcgaac cagatgttgt tacttataac tcgctcatgg attatctttg caagaacgga 600 

agatgcacgg aagcaagaaa gatttttgat tctatgacca agaggggcct aaagcctgat 660 

attactacct atggtaccct gcttcagggg tatgctacca aaggagccct tgttgagatg 720 

catggtctct tggatttgat ggtacgaaac ggtatccacc ctaatcatta tgttttcagc 780 

attctagtat gtgcatacgc taaacaagag aaagtagaag aggcaatgct tgtattcagc 840 

aaaatgaggc agcaaggatt gaatccgaat gcagtgacct atggaacagt tatagatgta 900 
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ctttgcaagt caggtagagt agaagatgct atgctttatt ttgagcagat gatcgatgaa 960 

ggactaagac ctgacagcat tgtttataac tccctaattc atagtctctg tatctttgac 1020 

aaatgggaga aggctgaaga gttatttctt gaaatgttgg atcgaggcat ctgtcttagc 1080 

actattttct ttaattcaat aattgacagt cattgcaaag aagggagggt tatagaatct 1140 

ggaaaactct ttgacttgat ggtacgaatt ggtgtgaagc ccgatatcat tacccttggc 1200 

aggtttttgg ggagcgcaag gcgcgactac tcactgttcg tcaacatcta cttcatcttc 1260 

accaacatgt cgaacactgg agacaaggag aaggagactc ccgtcaacac caacggaggc 1320 

aatactgcct caaactccag cggaggacca ttcttgggca catacaacat aatccttcat 1380 

ggactttgca aaaacaaact cactgatgat gcacttcgaa tgtttcagaa cctatgtttg 1440 

atggatttga agcttgaggc taggactttc aacattatga ttgatgcatt gcttaaagtt 1500 

ggcagaaatg atgaagccaa ggatttgttt gttgctttct cgtctaacgg tttagtgccg 1560 

aattattgga cgtacagatt gatggctgaa aatattatag gacaggggtt gctagaagaa 1620 

ttggatcaac tctttctttc aatggaggac aatggctgta ctgttgactc tggcatgcta 1680 

aatttcattg ttagggaact gttgcagaga ggtgagataa ccagggctgg cacttacctt 1740 

tccatgattg atgagaagca cttttccctc gaagcatcca ctgcttcctt gtttatagat 1800 
cttttgtctg ggggaaaata tcaagaatat catatatttc tccctgaaaa atacaagtcc 1860 
tttatagaat ctttgagctg ctga 1884 



<210> 35 
<211> 627 
<212> PRT 

<213> Oryza sativa 
<400> 35 

Met Ala Arg Arg Ala Ala Ser Arg Ala Ala Gly Ala Leu Arg Ser Glu 
15 10 15 

Gly Ser lie Gin Gly Arg Gly Gly Arg Ala Gly Gly Ser Gly Gly Gly 
20 25 30 

Ala Glu Asp Ala Arg His Val Phe Asp Glu Leu Leu Arg Arg Gly lie 
35 40 45 

Pro Asp Val Phe Ser Tyr Asn lie Leu Leu Asn Gly Leu Cys Asp Glu 
50 55 60 

Asn Arg Ser Gin Glu Ala Leu Glu Leu Leu His lie Met Ala Asp Asp 
65 70 75 80 

Gly Gly Asp Cys Pro Pro Asp Val Val Ser Tyr Ser Thr Val He Asn 
85 90 95 

Gly Phe Phe Lys Glu Gly Asp Leu Asp . Lys Met Leu Asp Gin Arg He 
100 105 HO 

Ser Pro Asn Val Val Thr Tyr Asn Ser He He Ala Ala Leu Cys Lys 
115 120 125 
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Ala Gin Thr Val Asp 
130 

Ser Gly Val Met Pro 
145 

Phe Cys Ser Ser Gly 
165 

Met Arg Ser Asp Gly 
180 

Met Asp Tyr Leu Cys 
195 

Phe Asp Ser Met Thr 
210 

Gly Thr Leu Leu Gin 
225 

His Gly Leu Leu Asp 
245 

Tyr Val Phe Ser lie 
260 

Glu Glu Ala Met Leu 
275 

Pro Asn Ala Val Thr 
290 

Gly Arg Val Glu Asp 
305 

Gly Leu Arg Pro Asp 
325 

Cys lie Phe Asp Lys 
340 

Leu Asp Arg Gly lie 
355 

Asp Ser His Cys Lys 
370 



Lys Ala Met Glu Val Leu 
135 

Asp Cys Met Thr Tyr Asn 
150 155 

Gin Pro Lys Glu Ala lie 
170 

Val Glu Pro Asp Val Val 
185 

Lys Asn Gly Arg Cys Thr 
200 

Lys Arg Gly Leu Lys Pro 
215 

Gly Tyr Ala Thr Lys Gly 
230 235 

Leu Met Val Arg Asn Gly 
250 

Leu Val Cys Ala Tyr Ala 
265 

Val Phe Ser Lys Met Arg 
280 

Tyr Gly Thr Val lie Asp 
295 

Ala Met Leu Tyr Phe Glu 
310 315 

Ser He Val Tyr Asn Ser 
330 

Trp Glu Lys Ala Glu Glu 

345 

Cys Leu Ser Thr He Phe 
360 

Glu Gly Arg Val He Glu 
375 



Thr Thr Met Val Lys 
140 

Ser He Val His Gly 
160 

Val Phe Leu Lys Lys 
175 

Thr Tyr Asn Ser Leu 
190 

Glu Ala Arg Lys He 
205 

Asp lie Thr Thr Tyr 
220 

Ala Leu Val Glu Met 
240 

He His Pro Asn His 
255 

Lys Gin Glu Lys Val 
270 

Gin Gin Gly Leu Asn 
285 

Val Leu Cys Lys Ser 
300 

Gin Met lie Asp Glu 
320 

Leu He His Ser Leu 

335 

Leu Phe Leu Glu Met 
350 

Phe Asn Ser He He 
365 

Ser Gly Lys Leu Phe 
380 
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Asp Leu Met Val Arg lie Gly Val Lys Pro Asp He He Thr Leu Gly 
385 390 395 400 

Arg Phe Leu Gly Ser Ala Arg Arg Asp Tyr Ser Leu Phe Val Asn He 
405 410 415 

Tyr Phe lie Phe Thr Asn Met Ser Asn Thr Gly Asp Lys Glu Lys Glu 
420 425 430 

Thr Pro Val Asn Thr Asn Gly Gly Asn Thr Ala Ser Asn Ser Ser Gly 
435 440 445 

Gly Pro Phe Leu Gly Thr Tyr Asn He He Leu His Gly Leu Cys Lys 
450 455 460 

Asn Lys Leu Thr Asp Asp Ala Leu Arg Met Phe Gin Asn Leu Cys Leu 
465 470 475 480 

Met Asp Leu Lys Leu Glu Ala Arg Thr Phe Asn He Met He Asp Ala 
485 490 495 

Leu Leu Lys Val Gly Arg Asn Asp Glu Ala Lys Asp Leu Phe Val Ala 
500 505 510 

Phe Ser Ser Asn Gly Leu Val Pro Asn Tyr Trp Thr Tyr Arg Leu Met 
515 520 525 

Ala Glu Asn He He Gly Gin Gly Leu Leu Glu Glu Leu Asp Gin Leu 
530 535 540 

Phe Leu Ser Met Glu Asp Asn Gly Cys Thr Val Asp Ser Gly Met Leu 
545 550 555 560 

Asn Phe He Val Arg Glu Leu Leu Gin Arg Gly Glu He Thr Arg Ala 
565 570 575 

Gly Thr Tyr Leu Ser Met He Asp Glu Lys His Phe Ser Leu Glu Ala 
580 585 590 

Ser Thr Ala Ser Leu Phe He Asp Leu Leu Ser Gly Gly Lys Tyr Gin 
595 600 605 

Glu Tyr His He Phe Leu Pro Glu Lys Tyr Lys Ser Phe He Glu Ser 
610 615 620 

Leu Ser Cys 
625 
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<210> 36 

<211> 1554 

<212> DNA 

<213> Oryza sativa 



<400> 36 

atggcgagaa gaggacgacg atactgtaga 
gctccggtgg ctgggaggtg gcgacgtcgg 
gagagccccg agctgcgacg gcatcacgcc 
gcaaagccgg tctacttcgc gtcgaggcgt 
ctcgtccggc ccaccccaat ctgggccgat 
ccgatctggg ccgtccatcc gcgccgccca 
tgccaggtcg gtgaccctcc gccgccggcg 
cgtgtcacca cccttacccg cgcccgcacc 
cagggtggta cgacccaaga cctagggcgc 
cacgtgctcg acgaattgcc gctacggggc 
accctcaccg acgtcgcgcg tgacagccca 
gcccgagccg gcgccgacga ggtaactccc 
tgctgctgcc gcgcgggccg cttggacctc 
aagggattta gagtggaagc catcaccttc 
aagaggacga gcgacgcaat ggacatagtg 
ccagatgttt tctcctgcac cattcttctc 
gaagctctcg agctgctgca catgatggct 
gtggtgtcgt ataccactgt catcaatggc 
tacagtacat accatgaaat gcttgatcgg 
tctattattg ctgcgttatg caaggctcaa 
accatggtta agaatggtgt catgcctgat 
tattgctctt cagggcagcc aaaagaggct 
ggcgtcgaac caaatgttgt tacttataga 
agatgcaccg aagctagaaa gattttcgat 
attgctacct atcgtaccct gcttcagggg 
catgctctct tggatttgat ggatcctgag 



gcagagggaa cagaggagcg gggcactggt 60 
cggccgaatg tgttcccgag cgccgcgctg 120 
gactaccggc cgtgggcggc gcacatggag 180 
gcctccgggc ggcccgagct gcagcagcag 240 
tgggccgatc tcagcctccc ggagcggaga 300 
gccaatcgga cggtgggtgt attactgtac 360 
gcggcggcgg cggcggcagg catggcgcgc 420 
cgcgcccgcg gcggcggcgt ccccagcgcg 480 
gcggggggca gtggcaccga gggcgcacgc 540 
tggggcgcct cgatctacag cttcaaccgc 600 
gccgcagcag tttcgctctt caaccgcatg 660 
gacttgtgca cctacagcat tctcatcggt 720 
ggtttcgcgg ccttgggcaa tgtcattaag 780 
gctcctctgc tcaagggcct ctgtgccgac 840 
ctccgcagaa tgaccgagct cagctgcatg 900 
aagggtctgt gtgatgagaa cagaagccaa 960 
gatgatcgag gaggaggtag cccacctgat 1020 
ttcttcaaag agggggattc agacaaagct 1080 
aggatttcac caaatgttgt gacctacagc 1140 
gctatggaca aagccatgga ggtacttaac 1200 
tgcatgacat ataatagtat tctgcatgga 1260 
attggaacac tcaaaaagat gcgcagtgat 1320 
tcactgatga attatctttg caagaatgga 1380 
tctatgacca agaggggcct agagcctgat 1440 
tatgctacca aaggagccct tgttgagatg 1500 
ttctacaagt atttggagaa gtga 1554 



<210> 37 
<211> 517 
<212> PRT 

<213> Oryza sativa 
<400> 37 

Met Ala Arg Arg Gly Arg Arg Tyr Cys Arg Ala Glu Gly Thr Glu Glu 
15 10 15 

Arg Gly Thr Gly Ala Pro Val Ala Gly Arg Trp Arg Arg Arg Arg Pro 
20 25 30 
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Asn Val Phe Pro Ser Ala Ala Leu Glu Ser Pro Glu Leu Arg Arg His 
35 40 45 

His Ala Asp Tyr Arg Pro Trp Ala Ala His Met Glu Ala Lys Pro Val 
50 55 60 

Tyr Phe Ala Ser Arg Arg Ala Ser Gly Arg Pro Glu Leu Gin Gin Gin 
65 70 75 80 

Leu Val Arg Pro Thr Pro lie Trp Ala Asp Trp Ala Asp Leu Ser Leu 
85 90 95 

Pro Glu Arg Arg Pro lie Trp Ala Val His Pro Arg Arg Pro Ala Asn 
100 105 110 

Arg Thr Val Gly Val Leu Leu Tyr Cys Gin Val Gly Asp Pro Pro Pro 
115 120 125 

Pro Ala Ala Ala Ala Ala Ala Ala Gly Met Ala Arg Arg Val Thr Thr 
130 135 140 

Leu Thr Arg Ala Arg Thr Arg Ala Arg Gly Gly Gly Val Pro Ser Ala 
145 150 155 160 

Gin Gly Gly Thr Thr Gin Asp Leu Gly Arg Ala Gly Gly Ser Gly Thr 
165 170 175 

Glu Gly Ala Arg His Val Leu Asp Glu Leu Pro Leu Arg Gly Trp Gly 
180 185 190 

Ala Ser lie Tyr Ser Phe Asn Arg Thr Leu Thr Asp Val Ala Arg Asp 
195 200 205 

Ser Pro Ala Ala Ala Val Ser Leu Phe Asn Arg Met Ala Arg Ala Gly 
210 215 220 

Ala Asp Glu Val Thr Pro Asp Leu Cys Thr Tyr Ser lie Leu lie Gly 

225 230 235 240 

Cys Cys Cys Arg Ala Gly Arg Leu Asp Leu Gly Phe Ala Ala Leu Gly 
245 250 255 

Asn Val lie Lys Lys Gly Phe Arg Val Glu Ala lie Thr Phe Ala Pro 
260 265 270 

Leu Leu Lys Gly Leu Cys Ala Asp Lys Arg Thr Ser Asp Ala Met Asp 
275 280 285 
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lie Val Leu Arg Arg Met Thr Glu Leu Ser Cys Met Pro Asp Val Phe 
290 295 300 

Ser Cys Thr lie Leu Leu Lys Gly Leu Cys Asp Glu Asn Arg Ser Gin 
305 310 315 320 

Glu Ala Leu Glu Leu Leu His Met Met Ala Asp Asp Arg Gly Gly Gly 
325 330 335 

Ser Pro Pro Asp Val Val Ser Tyr Thr Thr Val lie Asn Gly Phe Phe 
340 345 350 

Lys Glu Gly Asp Ser Asp Lys Ala Tyr Ser Thr Tyr His Glu Met Leu 
355 360 365 

Asp Arg Arg lie Ser Pro Asn Val Val Thr Tyr Ser Ser He He Ala 
370 375 380 

Ala Leu Cys Lys Ala Gin Ala Met Asp Lys Ala Met Glu Val Leu Asn 
385 390 395 400 

Thr Met Val Lys Asn Gly Val Met Pro Asp Cys Met Thr Tyr Asn Ser 
405 410 415 

He Leu His Gly Tyr Cys Ser Ser Gly Gin Pro Lys Glu Ala He Gly 
420 425 430 

Thr Leu Lys Lys Met Arg Ser Asp Gly Val Glu Pro Asn Val Val Thr 
435 440 445 

Tyr Arg Ser Leu Met Asn Tyr Leu Cys Lys Asn Gly Arg Cys Thr Glu 
450 455 460 

Ala Arg Lys He Phe Asp Ser Met Thr Lys Arg Gly Leu Glu Pro Asp 
465 470 475 480 

He Ala Thr Tyr Arg Thr Leu Leu Gin Gly Tyr Ala Thr Lys Gly Ala 
485 490 495 

Leu Val Glu Met His Ala Leu Leu Asp Leu Met Asp Pro Glu Phe Tyr 
500 505 510 

Lys Tyr Leu Glu Lys 
515 



<210> 38 
<211> 2784 



47 



WO 03/057859 



PCT/US03/00887 



<212> DNA 

<213> Oryza sativa 

<400> 38 

atgcccttcc gaccgcgcct cccgctcccc 
cgccgccgct cctccccccg cccgcccgtg 
tcggcggcgg ccgcggcggc ggaggtgacg 
agggacaccc gagcccctcc ctccatcggc 
tgcaatggcg ggggggctgc cgatgacgag 
cggatcaagc tctgccatga gcttctgcgg 
ttggcgcagc tggtgactga gcaaggtgag 
caactctact catgtggtat ctgaataact 
ggcatcggta gttttgaact tctgatcgat 
tccagaggca ttatcaattg atcattacca 
caatgcatac atagttgtat ttggtgtagt 
ggtctctgca agtatgacaa actagtaaca 
cttcctgctg taggatctgg gtctgcagct 
agagagtgtg attccaacgg ttgtgtatgg 
cagatggttc atgatgccct ttacgttctt 
tcggtgttca cctatgacag tttattgcac 
ctttttgaag aaatggagtc ttgtggtgtc 
attaatggcc tctgtaagca agataaggtt 
aggaaggagg gaaagtttaa acccttggga 
tgtaattggg ggtttgttca gtctgcaaaa 
ttagtccctg acaggtatac cttttctacc 
atggaggaag cattggatct tttcgagaga 
gtgacctaca atagccttat caatgggtac 
aaaatcatcc agatgatgag aggccaaggt 
cttattgctg gtcactgcga aagtggtgat 
gtcctagacc aaggtttgca gttgaatatt 
ttcaaaaaag gcatgttctg cgaaattgac 
ttggatatgg atgttatcgc atattccatc 
attgaaaagg ctcttcaagt atgtaatgca 
tcactgaacc atttttctat tcttctagga 
aggtggtatt tggaaaatgt agctagaaaa 
gtcgttattg atggttatgc aaaacttggt 
cagatcactg tagctggtat gcacccaacc 
tattgtaaaa ttggggatct gcaacttgcc 
ggacttctac caacagcagt gacatacact 
gaagttaata ccatgctaag tctttttgat 
gcagtaactt acagtgtcat tgttaaaggg 
atcaatgttc tcaaagatat ggatagcaaa 
acccttatac aaggtttctg tgaatcagaa 
atcatgttat gccgtggcct tgtgccgaca 
ctgtgtttga agggaaaagt tattcaagca 
ggcattaagt tgagaaaatt tgcgtacaca 
atgcctatca atgctgtttt gttagttggt 
attgaagatt tcagtgcagc aatcaatcga 



ctcctcctcc tcctcctgcc tcacctccgg 60 
cccgcgtgga gaccgctctc gtattatccc 120 
gagtccgagg aggacgcggc ggctgttggc 180 
gggattgcac ggggagcgcc tagggttggc 240 
gaggtcgaga ggaaggcccg cgctgtcgcg 300 
gagaggaggt ggcgcgcgat gcgggcagcc 360 
catgctatga attttcccca ttctgattat 420 
atggtgattg gtgtgaggag gcgtaggaat 480 
atgaatgtgt gacacaggat atattgtttt 540 
tataaaaaac agtaagaaaa gggtcgaaag 600 
attattactg taattcgttt tttactagaa 660 
taaaaattgt tcgcgtttaa tcttattgcg 720 
gctctctgtg acatcttatg gaacagattc 780 
gatgctclag cgaacagtta tgctagagct 840 
agtaaaatga gcagcctaaa catgcaaatc 900 
ggcttaagga tgacagacgt ggcattggag 960 
tctcccagtg aatattcgca tagtattatt 1020 
ggagaagctt tatctttcct tcaggaagct 1080 
atgaccttta acattcttat gtctgcattg 1140 
tcatttttat gcctgatgct gaaatatgga 1200 
cttatacacg gtctatgtaa agtaggttca 1260 
gtgacaaaag aaggaatgga acttgagatt 1320 
cgattgcttg gtttaacaaa agaaattcct 1380 
gttgaacctg atcttgttac atatactata 1440 
gttgaagaag gaatgaaggt aaggaaggat 1500 
gtcacatata gtgtccttct caatgctctc 1560 
aacctactcg gcgagatcta caatattggt 1620 
cttatccatg ggtattgcaa gctaggggaa 1680 
atgtgcagtt ctcagagggt aatgccaaca 174 0 
ctttgcaaga aaggattgtt agttgaagca 1800 
tatcagccaa ctgatgtagt gttctataat 1860 
gatattgtaa atgctgttcg tttgtatgat 1920 
attgtcacat gcaattctct tctatatggg 1980 
gagagctatt ttagggctat tcagctaagt 2040 
accttgatgg atgcactctc tgaagctgga 2100 
gaaatggttg caaagaggat caaggcaaat 2160 
ctttgtaagc agctcagatt tgatgaggct 2220 
ggtattaatg ctgacccgat aacttacaat 2280 
aacgttcaga tggctttcca catacatgac 2340 
cctgttactt ataacttgct tattaatgtg 2400 
gaaatacttt tggagtccct cagagaaaat 24 60 
acacttatca aagctcagtg cgcaaaagga 2520 
aagcttctag atgcaggatt tgaagcttct 2580 
ctttgcaaaa gacaatttgc caaagaagcc 2640 
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tttatgtttg tcccgattat gctatctgtt ggtatttacc cagatactca aatatattgt 2700 
gtgctaggca gagctctgca gaaaaatagt gagcttgtct atctacccat attaaatgca 2760 
cttgctgtta aaactggtat ttaa 2784 



<210> 39 
<211> 811 
<212> PRT 

<213> Oryza sativa 
<400> 39 

Met Pro Phe Arg Pro Arg Leu Pro Leu Pro Leu Leu Leu Leu Leu Leu 
15 10 15 

Pro His Leu Arg Arg Arg Arg Ser Ser Pro Arg Pro Pro Val Pro Ala 
20 25 30 

Trp Arg Pro Leu Ser Tyr Tyr Pro Ser Ala Ala Ala Ala Ala Ala Glu 
35 40 45 

Val Thr Glu Ser Glu Glu Asp Ala Ala Ala Val Gly Arg Asp Thr Arg 
50 55 60 

Ala Pro Pro Ser lie Gly Gly lie Ala Arg Gly Ala Pro Arg Val Gly 
65 70 75 80 

Cys Asn Gly Gly Gly Ala Ala Asp Asp Glu Glu Val Glu Arg Lys Ala 
85 90 95 

Arg Ala Val Ala Arg lie Lys Leu Cys His Glu Leu Leu Arg Glu Arg 
100 105 110 

Arg Trp Arg Ala Met Arg Ala Ala Leu Ala Gin Leu Val Thr Glu Gin 
115 120 125 

Gly Ser Gly Ser Ala Ala Ala Leu Cys Asp lie Leu Trp Asn Arg Phe 
130 135 140 

Arg Glu Cys Asp Ser Asn Gly Cys Val Trp Asp Ala Leu Ala Asn Ser 
145 150 155 160 

Tyr Ala Arg Ala Gin Met Val His Asp Ala Leu Tyr Val Leu Ser Lys 
165 170 175 

Met Ser Ser Leu Asn Met Gin He Ser Val Phe Thr Tyr Asp Ser Leu 
180 185 190 

Leu His Gly Leu Arg Met Thr Asp Val Ala Leu Glu Leu Phe Glu Glu 
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195 200 205 

Met Glu Ser Cys Gly Val Ser Pro Ser Glu Tyr Ser His Ser lie lie 
210 215 220 

lie Asn Gly Leu Cys Lys Gin Asp Lys Val Gly Glu Ala Leu Ser Phe 

225 230 235 240 

Leu Gin Glu Ala Arg Lys Glu Gly Lys Phe Lys Pro Leu Gly Met Thr 
245 250 255 

Phe Asn lie Leu Met Ser Ala Leu Cys Asn Trp Gly Phe Val Gin Ser 
260 265 270 

Ala Lys Ser Phe Leu Cys Leu Met Leu Lys Tyr Gly Leu Val Pro Asp 
275 280 285 

Arg Tyr Thr Phe Ser Thr Leu lie His Gly Leu Cys Lys Val Gly Ser 
290 295 300 

Met Glu Glu Ala Leu Asp Leu Phe Glu Arg Val Thr Lys Glu Gly Met 
305 310 315 320 

Glu Leu Glu lie Val Thr Tyr Asn Ser Leu lie Asn Gly Tyr Arg Leu 
325 330 335 

Leu Gly Leu Thr Lys Glu lie Pro Lys lie lie Gin Met Met Arg Gly 
340 345 350 

Gin Gly Val Glu Pro Asp Leu Val Thr Tyr Thr lie Leu He Ala Gly 
355 360 365 

His Cys Glu Ser Gly Asp Val Glu Glu Gly Met Lys Val Arg Lys Asp 
370 375 * 380 

Val Leu Asp Gin Gly Leu Gin Leu Asn He Val Thr Tyr Ser Val Leu 
385 390 395 400 

Leu Asn Ala Leu Phe Lys Lys Gly Met Phe Cys Glu He Asp Asn Leu 
405 410 415 

Leu Gly Glu He Tyr Asn He Gly Leu Asp Met Asp Val He Ala Tyr 
420 425 430 

Ser He Leu He His Gly Tyr Cys Lys Leu Gly Glu He Glu Lys Ala 
435 440 445 

Leu Gin Val Cys Asn Ala Met Cys Ser Ser Gin Arg Val Met Pro Thr 
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450 455 460 

Ser Leu Asn His Phe Ser lie Leu Leu Gly Leu Cys Lys Lys Gly Leu 
465 470 475 480 

Leu Val Glu Ala Arg Trp Tyr Leu Glu Asn Val Ala Arg Lys Tyr Gin 
485 490 495 

Pro Thr Asp Val Val Phe Tyr Asn Val Val He Asp Gly Tyr Ala Lys 
500 505 510 

Leu Gly Asp He Val Asn Ala Val Arg Leu Tyr Asp Gin He Thr Val 
515 520 525 

Ala Gly Met His Pro Thr He Val Thr Cys Asn Ser Leu Leu Tyr Gly 
530 535 540 

Tyr Cys Lys lie Gly Asp Leu Gin Leu Ala Glu Ser Tyr Phe Arg Ala 
545 550 555 560 

He Gin Leu Ser Gly Leu Leu Pro Thr Ala Val Thr Tyr Thr Thr Leu 
565 570 575 

Met Asp Ala Leu Ser Glu Ala Gly Glu Val Asn Thr Met Leu Ser Leu 
580 585 590 

Phe Asp Glu Met Val Ala Lys Arg lie Lys Ala Asn Ala Val Thr Tyr 
595 600 605 

Ser Val He Val Lys Gly Leu Cys Lys Gin Leu Arg Phe Asp Glu Ala 
610 615 620 

He Asn Val Leu Lys Asp Met Asp Ser Lys Gly He Asn Ala Asp Pro 
625 630 635 640 

He Thr Tyr Asn Thr Leu He Gin Gly Phe Cys Glu Ser Glu Asn Val 
645 650 655 

Gin Met Ala Phe His He His Asp He Met Leu Cys Arg Gly Leu Val 
660 665 670 

Pro Thr Pro Val Thr Tyr Asn Leu Leu He Asn Val Leu Cys Leu Lys ■ 
675 680 685 

Gly Lys Val He Gin Ala Glu He Leu Leu Glu Ser Leu Arg Glu Asn 
690 695 700 

Gly He Lys Leu Arg Lys Phe Ala Tyr Thr Thr Leu He Lys Ala Gin 
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705 710 715 720 

Cys Ala Lys Gly Met Pro lie Asn Ala Val Leu Leu Val Gly Lys Leu 
725 730 735 

Leu Asp Ala Gly Phe Glu Ala Ser lie Glu Asp Phe Ser Ala Ala lie 
740 745 750 

Asn Arg Leu Cys Lys Arg Gin Phe Ala Lys Glu Ala Phe Met Phe Val 
755 760 765 

Pro lie Met Leu Ser Val Gly lie Tyr Pro Asp Thr Gin lie Tyr Cys 
770 775 780 

Val Leu Gly Arg Ala Leu Gin Lys Asn Ser Glu Leu Val Tyr Leu Pro 
785 790 795 800 

lie Leu Asn Ala Leu Ala Val Lys Thr Gly lie 
805 810 



<210> 40 
<211> 1779 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 40 

atgaaggctt tgagattgat tcagcctcat ctcttgaaga caggtagtct tagaactgat 60 

ttgctctgta ccatttcgag tttcttttct agctgcgaac gagacttttc aagtattagc 120 

aatgggaatg tctgtttcag agagagattg agaagtggta ttgttgatat taagaaagat 180 

gatgctattg ctctgttcca agaaatgatt aggtctcgtc ctcttcctag tcttgttgat 240 

ttcagtagat tctttagtgc cattgccaga acaaaacagt tcaatctcgt gttagatttc 300 

tgcaagcaac tggaattgaa tgggattgct cataacatct acactttgaa tatcatgatc 360 

aactgctttt gccggtgttg taaaacttgt tttgcttatt ctgttttggg aaaagtaatg 420 

aagcttgggt atgagcctga cacaaccacg tttaacactc tgatcaaagg actctttctt 480 

gagggtaaag tgtctgaagc tgtggtttta gtcgatagga tggtggaaaa cggatgtcaa 540 

cctgatgtgg ttacttataa ttcgattgta aatgggatat gtagatcagg agatacttct 600 

ttggccttgg atttgctcag aaagatggaa gaaagaaatg ttaaggctga tgtgtttact 660 

tacagtacaa tcattgatag tctttgtaga gatggttgca tagacgctgc aattagcctt 720 

ttcaaggaaa tggagacgaa agggattaaa tctagtgttg ttacgtataa ttctcttgtg 780 

agaggtcttt gtaaagccgg taaatggaat gatggggcac tgttgttgaa ggatatggtg 840 

agtagggaaa tcgtccctaa tgtcatcact ttcaatgtat tacttgatgt ttttgtcaaa 900 

gaagggaagc ttcaggaggc taatgaattg tacaaagaga tgatcacaag aggtatatca 960 

cctaatatta ttacttataa taccttgatg gatgggtatt gtatgcagaa ccgtcttagt 1020 

gaggccaaca atatgttgga tcttatggtt aggaataagt gcagtcctga tatcgtgact 1080 

tttacaagtc tcatcaaagg atattgtatg gtgaaaagag ttgacgatgg tatgaaggtc 1140 

ttccgcaata tttctaagag aggcttggtt gccaatgcag ttacttatag cattcttgtc 1200 

caagggtttt gtcaatccgg gaaaataaag ctcgcagagg aacttttcca agaaatggtt 1260 
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tcacacggtg ttcttcctga tgttatgacg tatggtattt tgcttgatgg cttgtgtgac 1320 
aatgggaagc ttgaaaaggc attggaaatt tttgaggatt tacaaaagag taagatggat 1380 
cttggtattg ttatgtatac aaccatcatc gaggggatgt gcaagggtgg aaaagtggaa 1440 
gatgcctgga atttattctg tagcctacct tgtaaaggag tgaagcctaa tgttatgaca 1500 
tacaccgtga tgatttcagg attatgtaag aaagggtcac tgtctgaagc aaacatcttg 1560 
cttagaaaaa tggaggaaga tgggaatgcg ccaaatgatt gtacatacaa cacactaatc 1620 
cgggcacatc tccgagatgg tgacttaact gcatcagcta aacttattga agaaatgaag 1680 
agttgtgggt tctcagcaga tgcttccagt attaagatgg ttatcgatat gttattgagt 1740 
ggtgaattgg acaaaagctt tctagatatg ctttcgtaa 1779 



<210> 41 
<211> 592 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 41 
Met Lys Ala Leu 
1 

Leu Arg Thr Asp 
20 

Glu Arg Asp Phe 
35 

Arg Leu Arg Ser 
50 

Leu Phe Gin Glu 
65 

Phe Ser Arg Phe 



Val Leu Asp Phe 
100 



Arg Leu lie Gin 
5 

Leu Leu Cys Thr 



Ser Ser lie Ser 
40 

Gly lie Val Asp 
55 

Met lie Arg Ser 
70 

Phe Ser Ala lie 
85 

Cys Lys Gin Leu 



Pro His Leu Leu 
10 

lie Ser Ser Phe 
25 

Asn Gly Asn Val 



lie Lys Lys Asp 
60 

Arg Pro Leu Pro 
75 

Ala Arg Thr Lys 
90 

Glu Leu Asn Gly 
105 



Lys Thr Gly Ser 
15 

Phe Ser Ser Cys 
30 

Cys Phe Arg Glu 
45 

Asp Ala lie Ala 



Ser Leu Val Asp 
80 

Gin Phe Asn Leu 

95 

lie Ala His Asn 
110 



lie Tyr Thr Leu Asn lie Met He Asn Cys Phe Cys Arg Cys Cys Lys 

115 120 125 

Thr Cys Phe Ala Tyr Ser Val Leu Gly Lys Val Met Lys Leu Gly Tyr 

130 135 140 

Glu Pro Asp Thr Thr Thr Phe Asn Thr Leu He Lys Gly Leu Phe Leu 

145 150 155 160 

Glu Gly Lys Val Ser Glu Ala Val Val Leu Val Asp Arg Met Val Glu 
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165 170 175 

Asn Gly Cys Gin Pro Asp Val Val Thr Tyr Asn Ser lie Val Asn Gly 
180 185 190 

lie Cys Arg Ser Gly Asp Thr Ser Leu Ala Leu Asp Leu Leu Arg Lys 
195 200 205 

Met Glu Glu Arg Asn Val Lys Ala Asp Val Phe Thr Tyr Ser Thr lie 
210 215 220 

lie Asp Ser Leu Cys Arg Asp Gly Cys lie Asp Ala Ala lie Ser Leu 
225 230 235 240 

Phe Lys Glu Met Glu Thr Lys Gly lie Lys Ser Ser Val Val Thr Tyr 
245 250 255 

Asn Ser Leu Val Arg Gly Leu Cys Lys Ala Gly Lys Trp Asn Asp Gly 
260 265 270 

Ala Leu Leu Leu Lys Asp Met Val Ser Arg Glu lie Val Pro Asn Val 
275 280 285 

He Thr Phe Asn Val Leu Leu Asp Val Phe Val Lys Glu Gly Lys Leu 
290 295 300 

Gin Glu Ala Asn Glu Leu Tyr Lys Glu Met He Thr Arg Gly He Ser 
305 310 315 320 

Pro Asn lie He Thr Tyr Asn Thr Leu Met Asp Gly Tyr Cys Met Gin 
325 330 335 

Asn Arg Leu Ser Glu Ala Asn Asn Met Leu Asp Leu Met Val Arg Asn 
340 345 350 

Lys Cys Ser Pro Asp He Val Thr Phe Thr Ser Leu He Lys Gly Tyr 
355 360 365 

Cys Met Val Lys Arg Val Asp Asp Gly Met Lys Val Phe Arg Asn He 
370 375 380 

Ser Lys Arg Gly Leu Val Ala Asn Ala Val Thr Tyr Ser He Leu Val 
385 390 395 400 

Gin Gly Phe Cys Gin Ser Gly Lys He Lys Leu Ala Glu Glu Leu Phe 
405 410 415 

Gin Glu Met Val Ser His Gly Val Leu Pro Asp Val Met Thr Tyr Gly 
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420 

lie Leu Leu Asp 
435 

Glu lie Phe Glu 
450 

Met Tyr Thr Thr 
465 

Asp Ala Trp Asn 



Asn Val Met Thr 
500 

Ser Leu Ser Glu 
515 

Asn Ala Pro Asn 
530 

Arg Asp Gly Asp 
545 

Ser Cys Gly Phe 



Met Leu Leu Ser 
580 



Gly Leu Cys Asp 
440 

Asp Leu Gin Lys 
455 

lie He Glu Gly 
470 

Leu Phe Cys Ser 
485 

Tyr Thr Val Met 



Ala Asn He Leu 

520 

Asp Cys Thr Tyr 
535 

Leu Thr Ala Ser 
550 

Ser Ala Asp Ala 
565 

Gly Glu Leu Asp 



425 

Asn Gly Lys Leu 



Ser Lys Met Asp 
4 60 

Met Cys Lys Gly 
475 

Leu Pro Cys Lys 
490 

He Ser Gly Leu 
505 

Leu Arg Lys Met 



Asn Thr Leu He 

540 

Ala Lys Leu He 
555 

Ser Ser lie Lys 
570 

Lys Ser Phe Leu 
585 



430 

Glu Lys Ala Leu 
445 

Leu Gly He Val 



Gly Lys Val Glu 
480 

Gly Val Lys Pro 
495 

Cys Lys Lys Gly 
510 

Glu Glu Asp Gly 
525 

Arg Ala His Leu 



Glu Glu Met Lys 
560 

Met Val He Asp 
575 

Asp Met Leu Ser 
590 



<210> 42 
<211> 6711 
<212> DNA 
<213> Petunia sp . 

<400> 42 

atatatatat acaaactgat 
tgaaaaaggg tagctccgct 
cataatatat acaaaactca 
ttgtaatata gctattaggt 
tcggccattt ggctaaaaat 
ccagagcgac agtatacttc 



tttttctgtc tatttgcaca 
aataatgtta tctttacaaa 
ttcttatgta ttgtaaatat 
agtcatgttg tgtaaatttt 
atttcatttt atagtcgcat 
aatggtatac gatatcttct 



gtgttatttt tacataccct 60 
aaataacaat acttttttta 120 
gataaaaata ttgttatttt 180 
cctaaaaata tttacctgag 24 0 
atactccaag cttgtatatc 300 
attatacacc tttttagtat 360 
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tcgtataccc ccaatagtat acaagtttga 
tgtgtggcta caaaatttga gtgatggtag 
tgaaaacatt ctttttgaaa attgtagttc 
tatgtttctt gattctggct ggtcttctaa 
agcgaggggc aaataggcta catggccaaa 
gaaagtattc gaaaagattg tagggacaag 
gaaaaatgca cgaagaaatt caaaagcaaa 
agtgctgcct tagggagtga gaaatgggca 
ttattcatta taaacttaag gaaaagtgca 
ggaactttaa tcattatggg ctgaacttca 
aagtaaaaat ttagctcaca ggccactttt 
tattggtttt acatgagaga ccactttttg 
tgcctaagtt attaagttca acttcaatcc 
taaacttcaa ccttgtggac tgaagttgaa 
cttcaatcct tgtgggctga acttgtgtga 
aaatgacctc tcaagcaaaa tctgtaaaaa 
cgcaaagtag gctgaagttc agcccacaat 
tacctcctta tctcggttat gatcttttgt 
acgaaagacc acttttgcca ttggtctttg 
tttgtggcta cttttgctta aagagttgaa 
tccttaaaga ttgaacttcg atccagtgcc 
gatggaactt cagtccagag ccatatggac 
caatatttat ttagtaaagg cccaaaagtg 
taaaactaaa taacagtgtt aaaagtggct 
aggtagcaat tcaacttcaa cctatggtgt 
actaagtgaa gtgaagcgaa gatgatgaga 
ccctttttct cattctttgc ttattcaatt 
tccatttcag ttaaagggaa ttttggggtt 
gatgatgctt tcagtttgtt ccgtcaaatg 
tctttctcta aattgttgaa agctatggta 
ctttttcgag aaatccacaa attacgaatt 
gttaacagtt gttgccttat gcatcgtacc 
ttcaagaaag gcattccata taatgaagtc 
gctgaaaata aggtcaaaga tgctgttcat 
tgtgagccta atgaagtcat gtatggaacg 
actcaaaaag cttttgattt gctccggtta 
cgcacttaca ccattgtcat agacgccttt 
agccttttga atgagatgaa acaaaaaagc 
ttaattgatg ctttgtgtaa gttaagtcag 
atgatacatc ttaatattta tccaaatgtg 
tgcaaagagg ggaaagtaga agacgctgag 
gtagaccctg atgtgatcac ctataatatg 
gtggatagag cacgggaaat ttttgattcc 
attagctata atatactaat aaatggatat 
caagtctgcc gtgaaatttc tcaaaaggga 
ctcttgcatg gtctttttga acttggaaga 
atgctatctg cggggcacat ccctgattta 
tttaagaatg gacttgttga agaggctatg 



caccgcaata gtgtacccca atgttgtggt 420 
tgtaattttt tagtgtaagc tgggtagttt 480 
aagtcatagt acaaaaaact gaaatattta 540 
aaattttgaa atgctggcta gttttcattt 600 
tttttacgtt aaaagataag tgttgtctgg 660 
tgttgcctag acaacacgtc aaattatgta 720 
tattgcttaa gcaagaggca gtcaaagaca 780 
tcactataag attgtatttc catccgatat 840 
ggaaaaacca ctagttttta gctatttttt 900 
gactttgtgg gccgaacttc atacattcgc 960 
accactagta tttggtttga agtcattttt 1020 
gaacttcaat ctttgtgcgc ttgaacttca 1080 
gtaagggctg aatttttagg catagatgcg 1140 
cttcgcccct tatggtggcc tgaagttgaa 1200 
agttcaaccc acaaggatta aagtttcaaa 1260 
aaagtggtct ctcatgcact tttacccatt 1320 
tattcaagtt ccaaaaaatt tcacaatata 1380 
atgatttagc aaaatggacg gggaaagtgc 1440 
ggtacaggcc actaatacca aaatatttag 1500 
cttcagtcca gaggccggat tgaagttcag 1560 
atatggactg aagttcagtc aagtccttaa 1620 
tgaagttcag tcaattatca gaacttaagt 1680 
gttagtataa gaccaataaa aatagcggcc 174 0 
gatggacgaa atttctacaa aatggactcg 1800 
catagttgta caattcttcc aatcacccct 1860 
atttcagtgc gttactgtct caatggtaat 1920 
gcaccccgac attattctac caatacatgt 1980 
tctaatgaat ttcagaatgt taagtgttta 2040 
gttagaacta agcctcttcc ttctgttgcc 2100 
catatgaagc attactcttc tgttgtttct 2160 
cctgttcatg aattcatctt gagcattgtg 2220 
gatctcggat tttctgtatt agccattcac 2280 
acctttacta ccttaataag gggacttttt 2340 
ttgttcaaaa agttggtgag ggagaatata 24 00 
gtcatgaatg ggctttgcaa aaagggccat 24 60 
atggaacaag gaagcactaa gcccaataca 2520 
tgcaaagatg ggatgctaga tggtgctacc 2580 
attcctcccg acatttttac ttatagcact 2640 
tgggaaaatg ttaggacttt gttccttgag 2700 
tgcaccttca actccgtcat tgatggacta 2760 
gaaataatga gatacatgat tgaaaaaggt 2820 
ataattgacg gatatggctt gcgtggtcaa 2880 
atgatcaata agagcattga gcccgatatt 2940 
gccaggcaaa agaaaataga cgaggcaatg 3000 
ttgaaaccta gtattgttac ctgcaatgtt 3060 
actaaatctg cacaaaattt ctttgatgag 3120 
tacactcatt gtactttgct tggtggttat 3180 
tcacacttcc ataagttgga aagaaggaga 3240 
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gaagatacaa atattcaaat ttacacggct 
ctcgacaaag ctcatgctac gtttgagaag 
ataacataca ctgcaatgat tagtggatat 
gatatgctaa ggaaaatgga ggacaatggt 
attgtgcggg gatttctcag aagcaataaa 
atagctggga agagcttctc atttgaggca 
gcagaggatc catccataac acgcaaaatg 
tacaaggaga ttagcagaat aatcacaggt 
acaatcgaaa caaggtaaac tattaatact 
ccaattgaaa caggtaatat atatatttcc 
gtcatccttg tgcagaggcc atgctaatct 
tgatatttta gattatatac cccacagttc 
tgcatctgtc ataaaaggga ttgctcctat 
gtcggataaa tgagcaaata gtacatgttt 
tctacgggaa attatagtta tctatgtggt 
tatagattga ctgttttcct ctgtcattga 
gtatatgtgt acaatgcttt taacaactgg 
tcaacgaagt ctctttgtcg agatgcagct 
ggaaatgtga ttatcccatt caaagatact 
cagaagaata ttcagaaaag gcatgtagat 
gtgaggagga caattataca ttgtcctcca 
gggatctaaa ggacatgttc tgcatgtagc 
ggatttggta gactatacca gccttcattt 
atgtaaagga aactctatgg ttgtacacct 
ggtgtatatt agtatatacg gtagaattca 
cttggtttta gttcattaag gcataaatgt 
attcatttga tctatgatga tggagtcttg 
tacaatgggt tgtgtagtcc aatatgttaa 
gagattttgt gatatttatt cgtaaaatgc 
aagttttctt tttagtttgg tggacaacaa 
tcttattctt tggaataaaa tattgagctt 
atttacctta ctatacctat taaaaattac 
actgcgtatt tttttttata ctataagtct 
tacttaccca catcgggtgt atatacccaa 
atgaaattta aagtatttac aagtcatacc 
aagtatttac ccaacatacc cattttttat 
taaggctgca taaattatgg tataaaagtc 
tataccttat aatatataag tctggagggg 
cagataatat ttgaaagtag tggatagtgg 
ttgtaagtgt ctaatcaaca aatgcacgtc 
ccataattaa ttaatagaca ttctctcttc 
tggttaggta tatatatccg gtgtgagtaa 
taaatatgaa acttacatgc agaggcagat 
aaaacaatga ttaaattatc gcgcaaaacc 
tgtttgttat cattccataa tgaaaattac 
gtggctattg tgtcaagaat cataatcgtg 
cttttgtggc tatttagtca aaaatagtaa 
gccaactccg gccctacgtt catcaagtac 



gtcattgatg gattgtgcaa aaatggtaag 3300 
cttcccttga taggcttaca tcctgatgtg 3360 
tgtcaagaag ggttgttaga tgaagctaaa 3420 
tgtttggcag acaaccgaac atacaatgtt 3480 
gttagtgaaa tgaaggcttt tctggaggaa 3540 
gctactgtag agttattgat ggatattata 3600 
cactggatta aactgcacat tgcatgaata 3660 
ccgtcccaga caaccccaag gctaaatccc 3720 
ttaactgcca aaacttcttt aagaactatg 3780 
tttatttgga acatttctcg atttatgcgt 3840 
tctctcaatc gttcccaggt ttgatttgaa 3900 
tgcattgaaa tatgcaccca aacaaattag 3960 
tataccatca ttaagaaaat ccttgtgaca 4020 
gtttattttt tattcaataa gagtttgaca 4080 
cgtactttta aagaaaagta tttttgtggt 4140 
tcgactttct tttattcaca tcagaagtag 4200 
tttgcatgtg ctctttatgg tcgcatttca 4260 
tgacttgtta agacaagaac ctttcgattt 4320 
tgacagatgt ctcatgatgc ttatttgact 4380 
gtgcagcaaa tgacagagta tgtgcaatgg 4440 
ttgttcagtg gcaaatggca atacacctat 4500 
tagaagggat gcaagttcac agggaactag 4560 
tatcagttag tgaaatgaaa gaaaccatca 4 620 
tttgaagttc caagtgttaa actaacctct 4 680 
ttcaattgca caagtagatg tatctttttg 4740 
tctacttagg tttcatgcat taaaatgaac 4800 
gtcgtgcata tacatgcttc aaaattattg 4860 
catcatccac gacattatct ctaatagttt 4920 
atgttaagat tattgtaatt tagacttcta 4980 
agtaataaat ctcaaacaca ttgtttggtc 5040 
tttacaatgt gtacccttgg aatataaagt 5100 
attactcatg aaattcaaag tatctatcac 5160 
atatttacct tagggcaaaa attaggcaag 5220 
tcaacaaagg aatttttaca ctctataccc 5280 
cattaaaaat tactctaccc ataaaactaa 5340 
tgtatacctt tgtttatagg tgtatacctt 5400 
tggaggacca tttattattg tttacctttt 5460 
tagagtgagt aaatatttta atgggtaggc 5520 
ctgtaattat tttagatttc gtgggtattt 5580 
atttgtttac aatacactac tatcacttag 5640 
attacatcac attaccatag ttaattgcta 5700 
attttttcat ataaattatg gcaagacgag 5760 
aaatattttg attttgatga gtattttcgt 5820 
ctttcagttt gttttaatcg tgtacttatt 5880 
ctcattagtg ccacattatc tttctataat 5940 
ccaacttgct acattgtaaa aacaatgatt 6000 
ctctgctttc cattgtctcc ggtcacctcg 6060 
ttattttcca tatttattcg ttattttgtt 6120 
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aatacttaca atttgtttaa ttaaatcata gaattagctg atacacacat atatagtgaa 6180 
aaatgagata gtaactgaag cagctcaagt tcaattttta ggtgcaaaat tcttctatca 6240 
gttattatgt tttgctttca aattaataac atattcatat agccgacctc aactaattac 6300 
gcattgatgc atagttcatt gtactaggaa aagtaaaatt tcatttttaa gttagtttat 6360 
ttgagcaagt tatatatata tacacaatgc atgtgcttat atccctttcc aatgctaact 6420 
ctgacttcat gaaaattaaa ttataggtgt tactttagtg agggacgcga attaatatta 6480 
catcactggt agtggcggag ccagtatttt tactaaggag tatcaaaata taaataagta 6540 
aatatacgaa atattaaaag gatagtgaat cttccttttt aatgtacttc attttaaagt 6600 
tagtttattt gagaaagtta tatatataca atgtatgaac tgatattctt tgataatgat 6660 
gatgcctatg tggatagtga atcttccttt ttaatgttga tgaaaaataa a 6711 



<210> 43 
<211> 591 
<212> PRT 
<213> Petunia sj 

<400> 43 
Met Met Arg lie 
1 

Ser Phe Phe Ala 
20 

Cys Ser lie Ser 
35 

Asn Val Lys Cys 
50 

Arg Thr Lys Pro 
65 

Ala Met Val His 



Glu lie His Lys 
100 

Val Val Asn Ser 
115 

Val Leu Ala lie 
130 

Phe Thr Thr Leu 
145 



Ser Val Arg Tyr 
5 

Tyr Ser lie Ala 



Val Lys Gly Asn 
40 

Leu Asp Asp Ala 
55 

Leu Pro Ser Val 
70 

Met Lys His Tyr 
85 

Leu Arg lie Pro 



Cys Cys Leu Met 
120 

His Phe Lys Lys 
135 

lie Arg Gly Leu 
150 



Cys Leu Asn Gly 
10 

Pro Arg His Tyr 
25 

Phe Gly Val Ser 



Phe Ser Leu Phe 
60 

Ala Ser Phe Ser 
75 

Ser Ser Val Val 
90 

Val His Glu Phe 
105 

His Arg Thr Asp 



Gly lie Pro Tyr 
140 

Phe Ala Glu Asn 
155 



Asn Pro Phe Phe 
15 

Ser Thr Asn Thr 
30 

Asn Glu Phe Gin 
45 

Arg Gin Met Val 



Lys Leu Leu Lys 
80 

Ser Leu Phe Arg 
95 

lie Leu Ser lie 
110 

Leu Gly Phe Ser 
125 

Asn Glu Val Thr 



Lys Val Lys Asp 
160 
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Ala Val His Leu Phe Lys Lys Leu Val Arg Glu Asn lie Cys Glu Pro 
165 170 175 

Asn Glu Val Met Tyr Gly Thr Val Met Asn Gly Leu Cys Lys Lys Gly 
180 185 190 

His Thr Gin Lys Ala Phe Asp Leu Leu Arg Leu Met Glu Gin Gly Ser 
195 200 205 

Thr Lys Pro Asn Thr Arg Thr Tyr Thr He Val He Asp Ala Phe Cys 
210 215 220 

Lys Asp Gly Met Leu Asp Gly Ala Thr Ser Leu Leu Asn Glu Met Lys 
225 230 235 240 

Gin Lys Ser He Pro Pro Asp He Phe Thr Tyr Ser Thr Leu He Asp 
245 250 255 

Ala Leu Cys Lys Leu Ser Gin Trp Glu Asn Val Arg Thr Leu Phe Leu 
260 265 270 

Glu Met He His Leu Asn He Tyr Pro Asn Val Cys Thr Phe Asn Ser 
275 280 285 

Val He Asp Gly Leu Cys Lys Glu Gly Lys Val Glu Asp Ala Glu Glu 
290 295 300 

He Met Arg Tyr Met He Glu Lys Gly Val Asp Pro Asp Val He Thr 
305 310 315 320 

Tyr Asn Met He He Asp Gly Tyr Gly Leu Arg Gly Gin Val Asp Arg 
325 330 335 

Ala Arg Glu He Phe Asp Ser Met He Asn Lys Ser He Glu Pro Asp 
340 345 350 

He He Ser Tyr Asn He Leu He Asn Gly Tyr Ala Arg Gin Lys Lys 
355 360 365 

He Asp Glu Ala Met Gin Val Cys Arg Glu lie Ser Gin Lys Gly Leu 
370 375 380 

Lys Pro Ser He Val Thr Cys Asn Val Leu Leu His Gly Leu Phe Glu 
385 390 395 400 

Leu Gly Arg Thr Lys Ser Ala Gin Asn Phe Phe Asp Glu Met Leu Ser 
405 410 415 
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Ala Gly His lie Pro Asp Leu Tyr Thr His Cys Thr Leu Leu Gly Gly 
420 425 430 

Tyr Phe Lys Asn Gly Leu Val Glu Glu Ala Met Ser His Phe His Lys 
435 440 445 

Leu Glu Arg Arg Arg Glu Asp Thr Asn lie Gin lie Tyr Thr Ala Val 
450 455 460 

lie Asp Gly Leu Cys Lys Asn Gly Lys Leu Asp Lys Ala His Ala Thr 
465 470 475 480 

Phe Glu Lys Leu Pro Leu lie Gly Leu His Pro Asp Val lie Thr Tyr 
485 490 495 

Thr Ala Met lie Ser Gly Tyr Cys Gin Glu Gly Leu Leu Asp Glu Ala 
500 505 510 

Lys Asp Met Leu Arg Lys Met Glu Asp Asn Gly Cys Leu Ala Asp Asn 
515 520 525 

Arg Thr Tyr Asn Val lie Val Arg Gly Phe Leu Arg Ser Asn Lys Val 
530 535 540 

Ser Glu Met Lys Ala Phe Leu Glu Glu lie Ala Gly Lys Ser Phe Ser 
545 550 555 560 

Phe Glu Ala Ala Thr Val Glu Leu Leu Met Asp lie lie Ala Glu Asp 
565 570 575 

Pro Ser lie Thr Arg Lys Met His Trp lie Lys Leu His He Ala 
580 585 590 



<210> 44 
<211> 1779 
<212> DNA 
<213> Petunia sp. 

<400> 44 

atgatgagaa ttgcagtgcg ttactgtctc 
tattcaattg caccccgaca ttattctacc 
tttggggttt ctaatgaatt tgagaatgtt 
cgtcaaatgg ttagaactaa gcctcttcct 
gctttggtac atatgaagca ttactcttct 
ttacgtattc ctgttcatga attcatcttg 
catcgtaccg atctcggatt ttctgtatta 
aatcaagtta tctttaacac cttactaagg 



aatggtaatc cctttttctc attctttgct 60 
aatacacgtt ccatttcagt taaagggaat 120 
aagtgtttag atgatgcttt cagtttgttc 180 
tctgttgtct ctttctctaa attgttgaaa 240 
gttgtttctc tttttcgaga aatccacaaa 300 
agcattgtgg ttaacagttg ttgccttatg 360 
gccattcact tcaagaaagg tattccattt 420 
ggactctttg ctgaaaataa ggttaaagat 480 
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gctgttcatt tgttcaaaaa gttggtgagg gagaatatat gtgagcctaa tgaagtcatg 540 

tatggaacgg tcatgaatgg gctttgcaaa aagggccata ctcaaaaagc ttttgatttg 600 

ctccggttaa tggaacaagg aagtactaag cccaatacat gtatctatag cattgttatc 660 

gatgcctttt gcaaagatgg gatgctagat ggtgctacca gccttttgaa tgagatgaaa 720 

caaaaaagca ttcctcccga catttttact tatagcactt taattgatgc tttgtgtaag 780 

ttaagtcagt gggaaaatgt taggactttg ttccttgaga tgatacatct taatatttat 840 

ccaaatgtgt gcaccttcaa ctccgtcatt gatggactat gcaaagaggg gaaagtagaa 900 

gacgctgagg aaataatgag atacatgatt gaaaaaggtg tagaccctga tgtgatcacc 960 

tataatatga taattgacgg atatggcttg cgtggtcaag tggatagagc acgggaaatt 1020 

tttgattcca tgatcaataa gagcattgag cccaatatta ttagctataa tatactaata 1080 

aatggatatg ccaggcaaaa gaaaatagac gaggcaatgc aagtctgccg tgaaatttct 1140 

caaaagggat tgaaacctag tattgttacc tgcaatgttc tcttgcatgg tctttttgaa 1200 

cttggaagaa ctaaatctgc acaaaatttc tttgatgaga tgctatctgc ggggcacata 1260 

cctgatttat acactcattg tactttgctt ggtggttatt ttaagaatgg acttgttgaa 1320 

gaggctatgt cacacttcca taagttggaa agaaggagag aagatacaaa tattcaaatt 1380 

tacacggctg tcattgatgg attgtgcaaa aatggtaagc tcgacaaagc tcatgctacg 1440 

tttgagaagc ttcccttgat aggcttacat cctgatgtga taacatacac tgcaatgatt 1500 

agtggatatt gtcaagaagg gttgttagat gaagctaaag atatgctaag gaaaatggag 1560 

gacaatggtt gtttggcaga caaccgaaca tacaatgtta ttgtgcgggg atttctcaga 1620 

agcaataaag ttagtgaaat gaaggctttt ctggaggaaa tagctgggaa gagcttctca 1680 

tttgaggcag ctactgtaga gttattgatg gatattatag cagaggatcc ttctttgctt 1740 

aacatgattc cagaatttca ccgggataat aagaagtga 1779 



<210> 45 

<211> 592 

<212> PRT 

<213> Petunia sp . 

<400> 45 

Met Met Arg lie Ala Val Arg Tyr Cys Leu Asn Gly Asn Pro Phe Phe 
15 10 15 

Ser Phe Phe Ala Tyr Ser lie Ala Pro Arg His Tyr Ser Thr Asn Thr 
20 25 30 

Arg Ser lie Ser Val Lys Gly Asn Phe Gly Val Ser Asn Glu Phe Glu 
35 40 45 

Asn Val Lys Cys Leu Asp Asp Ala Phe Ser Leu Phe Arg Gin Met Val 
50 55 60 

Arg Thr Lys Pro Leu Pro Ser Val Val Ser Phe Ser Lys Leu Leu Lys 
65 70 75 80 

Ala Leu Val His Met Lys His Tyr Ser Ser Val Val Ser Leu Phe Arg 
85 90 95 
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Glu lie His Lys Leu Arg lie Pro Val His Glu Phe lie Leu Ser lie 
100 105 110 

Val Val Asn Ser Cys Cys Leu Met His Arg Thr Asp Leu Gly Phe Ser 
115 120 125 

Val Leu Ala lie His Phe Lys Lys Gly lie Pro Phe Asn Gin Val lie 
130 135 140 

Phe Asn Thr Leu Leu Arg Gly Leu Phe Ala Glu Asn Lys Val Lys Asp 
145 150 155 160 

Ala Val His Leu Phe Lys Lys Leu Val Arg Glu Asn lie Cys Glu Pro 
165 170 175 

Asn Glu Val Met Tyr Gly Thr Val Met Asn Gly Leu Cys Lys Lys Gly 
180 185 190 

His Thr Gin Lys Ala Phe Asp Leu Leu Arg Leu Met Glu Gin Gly Ser 
195 200 205 

Thr Lys Pro Asn Thr Cys lie Tyr Ser lie Val lie Asp Ala Phe Cys 
210 215 220 

Lys Asp Gly Met Leu Asp Gly Ala Thr Ser Leu Leu Asn Glu Met Lys 
225 230 235 240 

Gin Lys Ser lie Pro Pro Asp lie Phe Thr Tyr Ser Thr Leu lie Asp 
245 250 255 

Ala Leu Cys Lys Leu Ser Gin Trp Glu Asn Val Arg Thr Leu Phe Leu 
260 265 270 

Glu Met lie His Leu Asn lie Tyr Pro Asn Val Cys Thr Phe Asn Ser 
275 280 285 

Val lie Asp Gly Leu Cys Lys Glu Gly Lys Val Glu Asp Ala Glu Glu 
290 295 300 

lie Met Arg Tyr Met lie Glu Lys Gly Val Asp Pro Asp Val lie Thr 
305 310 315 320 

Tyr Asn Met lie lie Asp Gly Tyr Gly Leu Arg Gly Gin Val Asp Arg 
325 330 335 

Ala Arg Glu lie Phe Asp Ser Met lie Asn Lys Ser lie Glu Pro Asn 
340 345 350 
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lie He Ser Tyr Asn He Leu He Asn Gly Tyr Ala Arg Gin Lys Lys 
355 360 365 

He Asp Glu Ala Met Gin Val Cys Arg Glu He Ser Gin Lys Gly Leu 
370 375 380 

Lys Pro Ser He Val Thr Cys Asn Val Leu Leu His Gly Leu Phe Glu 
385 390 395 400 

Leu Gly Arg Thr Lys Ser Ala Gin Asn Phe Phe Asp Glu Met Leu Ser 
405 410 415 

Ala Gly His He Pro Asp Leu Tyr Thr His Cys Thr Leu Leu Gly Gly 
420 425 430 

Tyr Phe Lys Asn Gly Leu Val Glu Glu Ala Met Ser His Phe His Lys 
435 440 445 

Leu Glu Arg Arg Arg Glu Asp Thr Asn He Gin He Tyr Thr Ala Val 
450 455 460 

lie Asp Gly Leu Cys Lys Asn Gly Lys Leu Asp Lys Ala His Ala Thr 
465 470 475 480 

Phe Glu Lys Leu Pro Leu He Gly Leu His Pro Asp Val Tie Thr Tyr 
485 490 495 

Thr Ala Met He Ser Gly Tyr Cys Gin Glu Gly Leu Leu Asp Glu Ala 
500 505 510 

Lys Asp Met Leu Arg Lys Met Glu Asp Asn Gly Cys Leu Ala Asp Asn 
515 520 525 

Arg Thr Tyr Asn Val He Val Arg Gly Phe Leu Arg Ser Asn Lys Val 
530 535 540 

Ser Glu Met Lys Ala Phe Leu Glu Glu He Ala Gly Lys Ser Phe Ser 
545 550 555 560 

Phe Glu Ala Ala Thr Val Glu Leu Leu Met Asp He He Ala Glu Asp 
565 570 575 

Pro Ser Leu Leu Asn Met He Pro Glu Phe His Arg Asp Asn Lys Lys 
580 585 590 
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<210> 46 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 46 

tgcacagtgt tatatttaca taccc 25 

<210> 47 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 48 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 48 

tgaaaatgac aatcgtaaca gaaaa 25 

<210> 49 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 47 



tttatgatac atggatttca acgac 



25 



<400> 49 



aacattcctc cagacattat taca 



24 
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<210> 50 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 50 

gacgctgagg aaataatgag atac 24 

<210> 51 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 51 

tctagaaaaa atgaaggggg aatcaat 27 

<210> 52 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 

<400> 52 

gaattcactt tgctctcacg ataaactaag a 31 



65 



